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Erratum. 
Subscribers to Biometrika are requested to make the following correction in 
the paper by Mr Frank Sandon in Biometrika, Vol. Xvi. upon the “ Simplification 
of the Calculation of Abruptness Coefticients,” p. 194, 1. 11. In the value for py’ a 


Qu,/h 


N 


negative sign has been dropped before the term * in the square brackets. A 


computer recently wasted much time by not observing that the sign of this term 


was in error. 
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l. Introductory. The craniology of the ancient Egyptians has been more in- 
tensively studied than that of any other race of man and perhaps no other 
people has been accredited at different times with such varied origins. Since 
Morton (1), who supposed them the posterity of Ham, furnished descriptions and 
measurements of dynastic crania, the craniometric material has been slowly 
accumulating, although only a few of the series dealt with in more recent years 
can satisfy all the requirements of biometry. The present paper provides a notable 
addition to our eitieden in dealing with hitherto undiscussed measurements of two 
series of crania which are preserved in the Biometric Laboratory. One is of 50 
and 2 skulls from Royal Tombs of the First Dynasty at Abydos, measured by 
Mr G. H. Motley, and the other of 900 ¥ skulls from Gizeh belonging to the 26th 
to 30th Dynasties, measured by Miss A. Davin and other workers in the Laboratory, 
Both series were excavated by Sir Flinders Petrie. The latter is the only one 
available which is long enough to give really adequate representations of a 
statistical population, but there are several others sufficiently long to give quite 
reliable mean measurements. An attempt has been made to collect from published 
sources the mean measurements of all series of Egyptian ¢ crania from prehistoric 
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to Roman times which contain more than 20 specimens*. In estimating the inter- 
relations of such abundant material the need for the greatest possible refinement 
in method is at once felt. The differences between the various types are in many 
cases so small that it is impossible to determine their affinities from a comparison 
of isolated characters; the true sequences are obscured by errors of random 
sampling. In such cases the method of the Coefficient of Racial Likeness, which 
was originally suggested by Professor Karl Pearson and of which extensive use has 
now been made, becomes an indispensable aid. That criterion takes cognisance at 
one time of all the principal measurements of the skull and it gives a measure of 


the relationship between two series compared as a single numerical quantity. 
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Fig. 1. Diagram of the Inter-relations of Ancient Egyptian Series of Crania. 
In calculating this coefficient attention is focussed on the characters which differ 
most profoundly. Without some such criterion as the Coefficient of Racial Likeness, 
which can give a mathematical precision to our appreciation of the degree of 
resemblance between allied cranial types, it would have been almost impossible to 
determine the true relationship of the numerous ancient Egyptian series and to 
reduce them to any uniform plan. The comparison of the Coefficients leads to the 
arrangement of the types shown in Fig. 1. 


It is surprising that so many German craniologists writing in recent years, and with quite a lively 
sense of the need for full measurements of as many skulls of a home 


neous population as possible, 
should be content to publish individual measurements without means. 1 iiave had to caleulat 
of nearly all the material dealt with in this pape 
Laboratory. 


the means 
with the exception of the series in the Biometric 
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2. Map showing sites from which the cranial series dealt with in this paper were obtained, 
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In his Presidential Address to the Anthropological Section of the British 
Association in 1923 Professor P. E. Newberry said: “It is remarkable that so 
little is known about the early history of the Delta. Few excavations have been 
earried out there, and nothing of Pre-dynastic, or Early Dynastic, times has, so far, 
been brought to light from the country north of Cairo” ((2), p. 183)*. There are no 
measurements of Pre-dynastic skulls and very few of Early Dynastic specimens from 
Lower Egypt, but the long series of 26th to 30th Dynasties skulls from Gizeh makes 
our knowledge of the physical type prevailing there in later times quite exact. 
For the district around Thebes and Abydos in Upper Egypt the craniometric 
material is far more ample than for any other region of the same size in the world. 

A detailed comparison of all the ancient Egyptian material leads to the following 
main conclusions : 


(i) In Pre-dynastic times there was a primitive, dolichocephalic race living in 
Upper Egypt which may have been directly descended from the people whose 
palaeoliths are found on the high desert floor on either side of the Nile Valley. 
Though quite distinct from the main population, it bears certain relations to that 
more advanced type which suggests that the two had a common origin. This 
so-called Aeneolithie+ race may have been assimilated into the 
Upper Egypt if its numbers were smati 
of modification of the skull form of the 


main stock of 
, but there is not the slightest suggestion 
iatter resulting from such an admixture. 
At any rate the Aenecolithic type had disappeared from the Thebaid before the 
opening of the historic era. 

(ii) In Early Pre-dynastic times there were two distinct races of man living in 
Egypt; one in the Thebaid and the other, it is supposed, in the Fayum. These 
may be called the Upper and the Lower Egyptian races. They were as closely 
related to one another as two adjacent peoples are generally found to be, and there 
can be no doubt that they diverged from the same branch of the human tree at no 
very early date. The earliest direct evidence we heve of the Lower Egyptian type 
is in the persons of the royal retainers of the First and Second Dynasties who were 
buried in Abydos, but it is probable that they were intrusive in Upper Egypt as 
the type is quite distinct from that of the contemporary native population. The 
earliest series of Lower Egyptian type found in the Fayum belong to the Fourth 
Dynasty, but the presumptive evidence in favour of the view that the population 
there was of almost precisely the same type in much earlier times is very strong. 
The Upper Egyptian type is known directly from crania found in the Thebaid and 
dating from Early Pre-dynastic times. 

(iii) The Lower Egyptian type seems to have remained unchanged from Early 
Dynastic to Ptolemaic times except that a relatively small part of the population 
was modified very slightly, possibly by admixture with some unknown foreign race. 

, 


All the sites from which skull series dealt with in this paper were obtained are shown on the 
accompanying map (Fig. 2). 


t In using the term Aeneolithic, as in Italian 
Flinders Petrie, 


usage, I am adopting a suggestion made by Sir 
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(iv) The Upper Egyptian type was slowly transformed from the very earliest 
times in which we have acquaintance with it, and by Late Dynastic times the 
population of Upper Exypt was of almost pure Lower Egyptian type. 

(v) The pure Upper and Lower Egyptian types bear the following relations to 

one another : 

(a) All characters measuring lengths in the sagittal plane, except the basio- 
alveolar length GL (viz. L, LB, 8S, 8,, S., S;, S,’, fml and G,’), facial breadth 
characters (GB, O,', G, and NB), AZ and BZ of the fundamental triangle, 
the profile angle, occipital index and palatal index are identically the same 
for the two types. 

(b) The facial heights GH and NH’, foraminal breadth (fmb) and upper 
facial index (100 G’H/GB) of the Lower Egyptian type are slightly greater 
than those of the Upper, but the height index 100 H’/L is just appreciably 
smaller. 


(c 


Considerable differences are found between the characters Q’, U, J, B’ and 
cranial capacity, all being in this case primarily dependent on differences 
between the calvarial breadths, and the orbital height and index are greater 
for the Lower Egyptian type, while, in keeping with the latter difference, 
the nasal index is less. With its much greater calvarial breadth the Lower 
Egyptian skull has the smaller basio-bregmatic height and its basio-alveolar 
length and nasal angle are also smaller than those of the southern type. 

(d) The greatest differences are between the values of the calvarial breadth, 

the cephalic index, which is entirely dependent on B as LZ is constant, and 
the index 100 B/H', the latter being a peculiarly distinctive character as 
Band H’ bear opposite relations to one another. 

We conclude then that the calvaria of the Lower Egyptian skull has a greater 
breadth, a rather smaller height and the same length as that of the Upper Egyptian 
type. The facial bones of the two, other than the zygomatic, are equal in breadth 
but the broader skull has the rather longer face. 

These two types represent the extremes of the pure native Egyptian population 
from Early Pre-dynastic to Ptolemaic times and the relations between them are of 
the same nature as those which two allied, adjacent and contemporaneous races 
would bear to one another in general. The vast majority of the Dynastic series of 
crania from Upper Egypt are intermediate between the Upper and Lower Egyptian 
types. Characters—such as those related to the calvarial length—which are the 
same for the extremes are constant for all and where the greatest differences are 
shown the transition from one to the other can be clearly traced. The series 
represent various stages in the transformation of one type into the other. The 
types shown in the following table were selected because each represents an adequate 
number of crania* and shows no signs of intermixture with any race foreign to 
Egypt. The order in which they are arranged is that deduced from a comparison 


No mean measurement shown in this table is based on fewer than 36 skulls and all those not 
marked with an asterisk are based on more than 50, 





Series 
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of the Coefficients of Racial Likeness between them. Every series came from 
Upper Egypt except that of the 26th to 30th Dynasties which represents the 
pure Lower Egyptian type. 


Measured by | B  100B/L| 100B/H’ Hl’ J \100NB/NH’| Nz | GI L LB 
Early Pre- |) Thomson &) 137.4 | 74-7 981* | 134-0*| 124°3*| 50°3* 67°:0*| 98°8* 183°5 102-0% 
\ Maclver ) 


dynastic 


Naqada 4 
& & ( 1OUpss 


Late Pre- 
dy nastic 


{| Fawcett ... 132°7| 71°8 99°2 133°8 | 125°9* 50°0 


Thomson &) = “ * 
{ i « 33° a. OO! 32- a7. 51°0 66°°8 
\ Maclver § 1334 72°1 L00°5 132°7 | 1270 | 16 


65°°5*| 96°7* | 184°7 | 101° 
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| 
| 98°3 185°] LOG 
) 0) 2? a . . - ~ it - | 4 ad 
a aap 134°3 | 73°6 996 (1348 |1245 ] 50-4 65°°7 | 965 | 182°5 100°8 
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xv . 133°8 | 73°1 100% =|: 1330 | 126-4 188 65°°7 | 967 | 183-0 100°6 
rasties | 
Early and | | 
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nasties (#1 }) Toldt .. | 185°9 | 74°4 102°3) | 132°9 |125°7 | = 50°8 67°°5 | 98°1 | 182°7 | 100°1 
Kubanieh | 
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4 3 ale The Ss 5 QF - - © - © . OQ. _ 
pore , Mant eos 135°3 | 73°7 1029 (131° |1286 | 49°3 64°°3 | 95:0 = 183°5 | 100°7 
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ponte E. Schmidt 136°6 75:1 1036 =: 1319 | 128°3 50°1 661 | 96-4 — 181°9*| 100°6 
sHastles | 
Roman ) Thomson &) , ina - ee ; . 
: 36° 52 104° 30° 27 °5* 19-44 64°°3 | 94:2 81° 99°9 
Period (°" Mactver ( 136°4 io $4 1380°7 127°5 yr4 13 | 181 °4 
Ss 20 " a i o- SS dis. Si — = " “a . a. 
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No mean measurement shown in this table is based on fewer than 36 skulls and all those not 


marked with an asterisk are based on more than 50. 


The differences between adjacent types in the above table are so small that it 
is surprising to find that the sequences of characters are as regular as they appear 
to be. If the numbers on which the mean measurements were based had been 
smaller the orders would very probably have been hopelessly obscured for the 
characters showing the smaller differences between the extremes. All the characters 
show significant differences except Z, LB and Az on the extreme right of the 
table and the constancy of those three is very striking. 

(vi) The question how exactly the population of Upper Egypt was transformed 
from its pristine type to that of the Lower Egyptian race is one of considerable 
importance from both an evolutionary and an historical point of view. To answer 
it with any degree of finality we should require the evidence not of 5000 but of 
50,000 skulls and our present conclusions can be no more than tentative. The 
available series are not scattered randomly in time: there are very few for Pre- 
dynastic and Early Dynastic times, the later dynasties are well represented but the 
evidence relating to Ptolemaic and Roman times is meagre. The 18th to 20th 
Dynasties are far more adequately represented than any other period of the same 


length*. If the Pre-dynastic period was as long as the Dynastic, as is generally 


That is by the number of cranial series, but if the number of skulls is considered the 26th to 
30th Dynasties are far better represented than any other period as the long E series belongs to that time. 
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supposed, then the change in the Upper Egyptian type prior to the First Dynasty 
was exceedingly slow, but quite uniform as far as we can tell. Such a conclusion 
is quite in keeping with the archaeological evidence. In the early dynasties the 
movement became more rapid and by the 15th the skull form of the natives of 
Upper Egypt was approximately mid-way between those of the Upper and Lower 
Egyptian types. Either during or immediately preceding the 18th Dynasty there 
seems to have been a considerable acceleration of the movement. Four series 
belonging to the 18th to 20th Dynasties—two from Thebes and two from Abydos 
are of almost pure Lower Egyptian type, but like the Royal Tombs series of the 
First and Second Dynasties, to which they bear the closest possible resemblance, 
they still incline slightly towards the Upper Egyptian type. It is possible that 
those five series indicate sudden intrusions into Upper Egypt which were indepen- 
dent of the slower movement*. There are two other 18th and 19th Dynasties series 
which represent the slower and more normal transformation which pursued its 
course steadily down to Ptolemaic and Roman times. The rate of change after the 
Kirst Dynasty appears to have been on the whole remarkably uniform and there is 
ho suggestion of a retrograde movement that is to say a reversion to the Upper 
Kgyptian type—at any time. 

(vii) As examples of types which were entirely unaffected for several thousand 
years by any influences foreign to their country, the ancient Egyptians may well be 
unparalleled in the history of the world. All the series which do not appear to be of 
wbsolutely pure Egyptian type are underlined in Fig. 1 and, with the exception of 
Fouquet’s Aencolithic type, the divergences from the main stock are exceedingly 
small+, ‘The most aberrant series is Motley’s First Dynasty from the Royal Tombs 
of Abydos. It is most similar to the Lower Egyptian type, though inclining 
slightly towards the Upper Egyptian—the royal retainers being supposed intrusive 
in the Thebaid—but it also shows signs of admixture with an alien race which was 
certainly not Negroid. The calvariae of these First Dynasty skulls are not in any 
way peculiar, but the facial heights, and to a less marked extent the facial breadths 
are characteristically large. No other series dealt with in this paper has shown any 
tendency to diverge in the same direction. Three series from Lower Egypt are of 
pure Lower Egyptian type for all characters except the basio-bregmatic height 
which has values which are quite significantly greater than that of the 26th to 
30th Dynasties typet. An 18th Dynasty series from Abydos, which in all other 
Ways is intermediate between the Upper and Lower Egyptian types, is also 

It is quite probable that the 18th Dynasty series which closely resemble the pure Lower Egyptian 
type were representatives of the middle and upper classes of the population, while the lower social orders 
were closer to the pure Upper Egyptian type. Thomson and Maclver have provided measurements of 
two 18th Dynasty series. One from Shekh Ali was found in a pit and it undoubtedly represented 
an inferior class; the other, from Abydos, was obtained in a middte class cemetery. The Shekh Ali 
series is distinctly closer to the earlier type of Upper Egypt than the other. (See (13), p. 29.) 


\t different times during the Dynastic era there were undoubtedly quite large numbers of foreign 
slaves in the country. The only skull measurements ¢ 
crania of the Third Dynasty (8). 


f such people available seem to be those of pygmy 


t Sir Flinders Petrie’s field measurements of Fourth to Ninth Dynasties skulls from Qau indicate 
precisely similar divergence {see p. 29, ftn. + below). 
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characterised by a large basio-bregmatic height. Judging from means reduced by 
Lee from Schmidt’s individual measurements (see (6), p. 426), the modern Copts 
have diverged greatly from the type of the dynastic Egyptians and in no way more 
significantly than in their reduced length (2 =177°0) and greater cephalic index 
(100L/L =77°3). But the modern natives of Northern Abyssinia measured by 
Sergio Sergi (26) are surprisingly similar to the type which predominated in 
Upper Egypt during the middle dynasties. The relationship between one of the 
latter and the Abyssinians is decidedly closer than that between two series—the 
Whitechapel and Moorfields—of seventeenth century Londoners. This is a striking 
example of the persistence of a type with only slight modification for a period of 
at least 3000 years. There are two middle dynastic series which diverge slightly 





from the pure native population contemporaneous with them in the ways which 
were more accentuated in the later Abyssinians. 

(vill) It is very generally supposed that the population of ancient Egypt was 
sensibly affected at various times by the infusion of Negro blood. But in the series 
of which we have the mean measurements it is not possible to detect the slightest 
effect of any such admixture that can have taken place after early Pre-dynastic 
times. Apurt from isolated negroid skulls which are said to have been excluded 
from several of the series, the populations appear to be quite homogeneous and we 
have no reason to suppose that the mean type was affected in the slightest by 
admixture with any race foreign to Egypt. But it is quite possible that the race 
we have called the pure Upper Egyptian had originally some Negroid elements in 
its make-up. There are very few measured series of Negro skulls which are 
sufficiently long to give reliable determinations of type. A comparison of the 
more reliable material shows that there is a considerable variation in calvarial 
measurements among Negro races: all the Egyptian series fall within the extremes 
of length, breadth and cephalic index shown by the Negro types. But certain 
other measurements—chiefly facial—apparently distinguish all Negro from all 
Egyptian types. In the table on p. 9 the Upper Egyptian type is represented by 
the Early Pre-Dynastic and Nagada series and the Lower Egyptian by the 26th 
to 30th Dynastic series. The Negro races were selected because more or less 
adequate skull measurements have been furnished for them but for no other reason. 
All the measurements are of # skulls. 

The characters NH(R), NB, NB/NH(R), PZ, Nez and Bz clearly dis- 
tinguish the Egyptian from all Negroid types: they seem to be very constant for 
the latter. At the same time we find that for each one of those measurements the 
Upper Egyptian types are nearer to the Negro values than are the Lower Egyptian, 
and the Early Pre-dynastic is very slightly nearer to the Negroes than is the Naqada 
type. Much the same relations are found for the basio-alveolar length (@L), although 
the low value for the Congo series is discordant, and the indices 100H'/Z of the 
Upper Egyptian types are also between the Lower Egyptian and Negro values. 
With the facial height (GH) and index (100G’H/GB) the case is rather different. 


For those two characters the Bantu Negroes are almost identically the same as the 


ancient Egyptians, but the smaller values of the non-Bantu races differentiate that 
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group from the others although they are more similar to the Upper than to the 
Lower Egyptian values. The flattened occipital bone of the non-Bantu Negro 
skull is a most characteristic feature but we cannot say that in that respect one 
Egyptian race is more negroid than the other. From a comparison of facial 
measurements alone we can at least say that there is a distinct suggestion of 
negroid affinities in what has been called the Upper Egyptian race. The calvarial 
measurements do not support that view, and we cannot yauge the significance of 
the observed relationships while any knowledge relating to the skull form of a type 
which has resulted from the crossing of two races as markedly different as a pure 
Negro and a pure Egyptian is quite lacking. 

In dealing with the long-bones of the Nagada series Warren (9) found that 
there was a suggestion of negroid affinities *. 

(ix) A study of all the series of early Egyptian crania that have as yet been 
measured shows that they are inter-related in a simple way. It is most probable 
that the observed differences between the types are to be explained as being due 
to some slow moving evolutionary process—whether through selection or directly 
through environment—or to a slow blood mingling of different races. On the latter 
hypothesis it would only be necessary to assume the existence of two original races, 
wn Upper and a Lower Egyptian, which were distinct in Early Pre-dynastic times. 
The gradual approach of the Upper Egyptian population to the Lower Egyptian 
type—a movement, it should be noted, which was not completed as late as Roman 
times—would then be due to admixture in the Thebaid, while the original northern 
type was apparently persisting in the Fayum without modification. On the other 
hand, an evolutionary hypothesis would give an explanation of the facts which 
would be equally reasonable. Additional evidence, however, such as that which 
would result from a study of the variabilities of the series dealt with in this paper 
or the discovery of an adequately long Pre-dynastic cranial series in Lower Egypt, 
might tell strongly in favour of the one hypothesis or the other. li may be that 
the observed changes were the result of both racial admixture and an evolutionzry 
process. 

2. Methods of Measurement and Remarks on the Cranial Anomalies of First 
Dynasty Royal Tombs skulls from Abydos. The individual measurements taken by 
Mr G. H. Motley and “remarks” on the more outstanding cranial anomalies are 
contained in the Appendix to this paper. The methods of measurement used in 
dealing with the First Dynasty and the 26th to 30th Dynasties series from Gizch, 
which was measured by Miss A. Davin and others, are precisely those which are 
ordinarily taken in the Biometric Laboratory+. The majority are in accordance 

Warren says: ‘‘On the whole, the proportions of the limb-bones to one another may be said to 
have approached those of the Negro, while the sacral and scapular indices were almost identical with 
those of Europeans.” ((9), p. 191.) 

+ Full descriptions of the measurements are given in Biometrika, Vol. xiv. pp. 196—200. Briefly: 

I'=Ophryo-occipital length, L’=glabellar projective length, L =glabella-occipital length, B=maxi- 
mum parietal breadth, B’=least forehead breadth, 1/’=basio-bregmatic height, H=Frankfurt vertical 


height, OlJ=Frankfurt auricular height, L2=basion to nasion, ()=transverse arc through ‘‘ apex ” 


terminating at ear-rods of craniophor, Q’=similar to Q terminating at auricular points, S=sagittal arc 
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with those of the Frankfurter Verstiéndigung. Two additional measurements are 
given for the First Dynasty and for the re-measured Naqada skulls (see notes to 
Table I). One is the measurement of nasal height (VH/’) from the nasion to the 
base of the anterior nasal spine which is used by Broca, Flower and the majority 
of French and English craniometricians. NH’ is slightly less than the Frankfurt 
measurements VH, Rk and NH, L. The other (HOW) may be called the external 
ocular distance: it is taken between the points where the borders of the orbital 
ridges right and left meet the fronto-malar sutures (see pp. 22 and 23 below). 
HOW is the same as Martin’s innere orbitale Gesichtsbreite* : that and the dussere 


orbitale Gesichtsbreite are occasionally taken by German craniometricians. 
y y 


Remarks on the individual crania of the First Dynasty series are given in the 
Appendix. Unless otherwise stated each skull is complete with a mandible and of 


adult age. The following characteristics and anomalies were looked for. 


Signs of age. Condition of sutures, development and loss of teeth and the 
falling in or thinning of the calvaria. 


Condition of teeth. The teeth of this series were remarkably well preserved 
though all were worn. Many of the skulls that were ageing had not lost a single 
tooth. Hardly any cases of irregular formations round the palate were found and 
there was only one carious tooth. 


Palate. Of the 48 skulls only one has bony ridges across the palatine grooves 
leading from the pterygo-palatine canals. This is an unusually low percentage but 


it is quite possible that ridges originally possessed by other specimens had been 
broken. 


Pre-condyles. No pre-condyles were found at the basion or opisthion but there 


were several cases of projecting spines of bone in other regions. 
T'ympanic perforation. All cases were recorded. 


Base of the pyriform aperture. This feature is indicated by the letters P.n. 
The floor is described as “ flat ”—i.e. parallel to the Frankfurt horizontal plane— 


“sloping upwards,” meaning sloping upwards towards the pyriform border, or 


nasion to opisthion, S;=are nasion to bregma, S.=are bregma to lambda, S;=are lambda to opisthion, 
S;'=chord lambda to opisthion, U=horizontal circumference, PH=alveolar point to tip of anterior 
nasal spine, G’JJ =nasion to alveolar point, G2 =breadth between lowest points on zygomatic-maxillary 
sutures, J=zygomatic breadth, NH, R and L=nasion to lowest edge of pyriform aperture R and L, 
NB=nasal breadth, DS=dacryal subtense, DC=dacryal chord, DA 


dacryal arc, SS=simotice sub 
tense, SC 


simotic chord, O,, R and L.=breadths of orbits using Fawcett’s curvature method, O,’= orbital 
breadth from dacryon, O., R and L=heights of orbits, G;=palate length from point of spina nasalis 
posterior to an imaginary line tangential to the inner rims of the alveoli of the middle incisors, 
G,'=similar to G, from base of spine, G.=palate breadth between inner alveolar walls at second 
molars, GlL=basion to alveolar point, fml=length of foramen magnum, fmb=breadth of foramen 
magnum, P 2 =Frankfurt profile angle, Oc. I.=occipital index, NZ, AZ and Bz found from lengths 
LB, G’'H and GL with aid of Pearson’s Trigonometer, 0;=180°-Pz—-NZ, 0.=P2-Az, EH and 
EB=height and breadth of palate using Pearson’s Uraniscometer. The capacities were found by the 
method described by Macdonell (Biometrika, Vol. mm, pp. 203—206). 

* Rudolf Martin, Lehrbuch der Anthropologie, p. 552, Kd. 1914. 
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“sloping downwards.” The edge is said to be sharp, flat or rounded. Excluding the 
P Negro skull, the following frequencies are found for the combined ~* and ¢ First 
Dynasty Egyptians. 








Edge of Pyriform Aperture 
Totals 

Sharp Blunt | Rounded 
| sloping ywWardas 2 ) 
Floor of Pyriform oe J abcde ‘i wd x 12 4 <1 “ 
A verture at see eee eee ov ‘ a i 
= Sloping downwards . 5) 2 2 9 
Totals sv 26 13 { 43 











Asymmetry. SL or SR denotes whether the Sylvian depression is greater on 
the right or left side and in the same way JR and JL refer to the relative sizes of 
the jugular foramina. The following frequencies are found and the balance in 
favour of the right side is similar to that usually found in racial series. 


Sylvian Jugular 
Depression Foramen 


Greater on right side 14 20 
| Equal oe _ 28 10 
Greater on left side... 3 10 


There were few cases of calvarial or facial asymmetry. 

Malar bones. No horizontal sutures across the malar bones were found. 

Metopism. There were no cases of the persistent frontal suture. 

Ossicles in the sutures. These, as ordinarily, were confined almost entirely to 
the lambdoid suture. There was one ossicle of the X. 

Conformation at the pterion. All cases of epipteric bones and processes of the 
temporal articulating with the frontal were recorded. 

[nterparietal bones. There was one case of the simple interparictal aud one of 
the tri-partite interparietal with the os pentagonale fused into the supra-occipital. 

Other features. Cases of flattening of the obelion are recorded. In a number of 
the skulls a slight post-coronal depression was observed, but it was not very marked 
in any case. These skulls seem to be more symmetrical and to exhibit fewer 
anomalies than most racial series. 

3. Prehistoric Series of Kgyptian Skulls. The earliest series of prehistoric 
Egyptian crania of any length, and of which measurements have been provided, 
belong to a period in which metal was used. Palaeolithic implements have been 
found in abundance on the terraces of the Nile, but it may be doubted whether 
the country ever passed through a true Neolithic phase. The opinion of 
Sir Flinders Petrie ((4), p. 28) is that as soon as the sediment of the river had 


been deposited to a sufficient depth to make agriculture possible, an alien people— 


supposed Libyan—who were already acquainted with the use of copper, invaded 
the country and settled along the banks of the Nile. The autochthonous popu- 


lation at that time appears to have been one of palacolithic hunters and the 
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invaders did not exterminate either the people themselves or their primitive 
culture. It is unfortunately not yet possible to determine with any great accuracy 
the chronological sequence of the various Pre-dynastic series of crania that are 
available. In the case of the prevailing type an order can be given by using the 
method of Petrie’s sequence dates (see (4), pp. 4—12) which are determined chiefly 
from a comparative study of the pottery; but it is apparently not possible to 
establish with certainty the relative ages of the burials of the primitive peoples 
who maintained their lower stage of culture though living in close proximity to 
the invaders. All the skulls that can be dealt with came from a comparatively 
small area of Upper Egypt round Thebes and Abydos. There was another centre 
of civilisation in Pre-dynastic times in the Fayum*, but no measurements of early 
skulls from that region are available and there are none of the contemporaneous 
populations of Middle Egypt. 


The affinities of the series of skulls may be estimated by comparing the 
Coefficients of Racial Likeness+ between their mean measurements which are shown 
in Table I (p. 14). In the majority of cases the numbers of skulls on which the means 
are based are fairly adequate, but there are few series for which more than half of 
the 31 characters which should ideally be used have been provided. It is important 
to remember that a generalised measure of relationship which does not take into 
account several important regions of the skulls may be misleading and particular 
caution has to be taken when comparing such measures with those calculated for 
all the 31 characters. 


In the appendix to the first volume of J. de Morgan’s researches into the origin 
of Egyptian civilisation (10) Dr Fouquet gave individual measurements of 11 skulls 


that had been disinterred at El-Amrah near Abydos}. From the contracted position 


of the bodies and the nature of the grave furniture, which comprised abundant 
pottery but very little metal, it was reasonable to infer that the burials were of 
Pre-dynastic date though from the description of the artefacts that has been given 
it is not possible to assign them to any particular period of that era. The mean 
measurements of the nine specimens are given in the second column of our Table I 
The series in columns 3, 4 and 5 of that table are the longest provided by Fouquet§ 
in the second volume of de Morgan’s researches (11) and they were thought to be 
contemporary with the El-Amrah skulls. It is clear that the four series represent 
very similar if not identical types. The numbers of skulls in each series are small 
but the method of the Coefficient of Racial Likeness may profitably be applied. 
™ 1e Coefficients in Table II (p. 16) were calculated for 19 characters of which six 


were indices. The numbers in brackets are the mean numbers of skulls available 


for those 19 characters in the case of each series. 


A map showing all the principal Pre-dynastic stations in Egypt is given by J. de Morgan ((10), p. 68). 
+ See Biometrika, Vol. xvi. pp. 11—14 for a definition of the Coefficient of Racial Likeness and an 
account of the precautions that have to be taken when applying it in practice, 

+ All sites referred to in this paper are shown on the accompanying map (Fig. 2, p. 3). 
§ Two series from Naqada North and Guebel-Silsileh ((11), pp. 326 and 336 respectively) contain 
fewer than five skulls each and their means are not sufliciently like those of the longer series to make it 
safe to pool them 
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15 
Notes and footnotes to Taule I. 


The indices and angles in curled brackets were determined from the mean measurements of the 
component lengths instead of from the individual measurements. The capacities in square brackets 
were reconstructed by the formula given by Lee and Pearson (Phil. Trans. Series A, Vol. 196, p. 247): 
C="000266 x L x B x H+524-6c.cs. H’ was used in place of the slightly larger H, so the calculated 
capacities are likely to be rather smaller than the true ones. 

Through the kindness of Professor Karl Pearson, I was able to examine the Naqada erania dealt with 
by Faweett(6) and to take several measurements that had not been previously found. When first 
brought to England the skulls were in a very fragile state and it is no longer possible to take any 
measurements of several that were included in Fawcett’s tables of individual measurements. Excluding 
those skulls, I found a close agreement between Faweett’s measurements and my own for all characters 
except the following (see pp. 18—19 below): 


B B’ LB G' HI GL NH, 2 Gy O,R 
Fawcett | 134°5(122) | 91-2(130) 99-9(100) | 67°3(80) 95°0(68 | 49°1(84) 12-5 (82) 
Morant 133°4(110) | 92°5(122) | 101°1(87) 69°7(70) . 96°1(62) | 50°3(73) 41-6 (76) 





The disparity between the numbers of skulls on which the means are based cannot account for the 


whole of the differences between them and we are bound to accept the conclusion that the measurements 
were taken in slightly different ways. In this paper I have used my own measurements, and, besides 
the above characters, the angles of the fundamental triangle, the indices involving B, 100 G'11/GB, 
100 NB/NH, R, 100 02/0, Rk and 100 Gy/G, are altered from their original values. In addition to the 
means in Table I the following are now available for the two Naqada groups (see pp. 18, 19 below): 











L' H OH Ss S S 
A and ( Series 185°4 (87) 136°4 (71) 115°6 (98) 372°6 (90) 116°9 (85) 96°5 (84) 
B, T and KR Series 182°8 (20) 134°7 (23) 113°8 (26) 372°3 (20) 116°9 (23) 97-0 (23) 
Q' GH GB NH,R PU Low 
A and Q Series 302°9 (87) 69°7 (52) 95°9 (60) 502 (54) 20°1 (56) 96-1 (56) 
B, T and PR Series 307°9 (18) 69°6 (18) 95°1 (18) 50°5 (19) 19-9 (17) 94-4 (18) 
G Gy’ Gs O,N Ooh 100 0/0, R 
A and Q Series 52°5 (49) 18-0 (49) 10°7 (48) 41-6 (57) {77-4 (46)! 
B, T and R Series 50°7 (17) 46°1 (17) 40)°3 (19) 4$1°7 (19) (77-0 (19)} 
100 NB/NH,R 100 G./G, 100 G./G,’ 100 G’H/GB 100 2/L' 100 H/L’ 
i and Q Se ries }49°8 (54)! '77°5 (48)! {84 8 (48)! {72°7 (52)! 171-6 (87)} 173°6 (71)! 
B, T and 2 Series {49-7 (18)! '79°5 (17)! 187°4 (17)! {73°2 (18)! {74°5 (20)! '73°7 (20)! 
100 H/L L00 D/H Pe 0; 0 Oc. 1 
1 and () Series 173°8 (71)} 497°3 (71)! 84°°6 (50) 129°°9 (46)! $11°°3 (46)! 60°2 (84) 
B, T and RP Series 173°2 (23)} {101°0 (22)! 86°°1 (16) |29°°6 (16)  12°°2 (16)! 60°5 (23) 





Wrongly given by Fouquet as 184°6. | Wrongly given by 
37°7. S 29 


Fouquet as 88-1. 
2 ss 86°8, 
The means in this column are given by Chantre (15) in whole mms. only and it is probable that 
they were not calculated further, so they are not necessarily correct to the nearest mm. 


” ” ” 


The mean 
indices were calculated from these curtailed lengths and they may easily differ from the true means by 


as much as one unit. There were 24 3 skulls in the series but some lengths, and in particular H’ and J, 
could almost certainly not be taken on all of them. Chantre does not say how many skulls each mean 


was based on. The material is presented in such a thoroughly unsatisfactory way that it is practically 
worthless. 
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El-Amrah (7°7 ap CDawactens 
6 Indices 


cawamil (14°2 > . 
Kawamil (14 & Gedlecs 


Beit-Allam (8°3) : : 
: : 6 Indices 


vaqada S 25°¢ ; 
Naqada South (25°9 @ Indices 


the other nine characters are: 


100 B/L |\100 B/H 


Kawamil 0°07 0°23 

El-Amrah_ } Beit-Allam 3°66 2°47 

and nar 0-04 0-01 
Do l 

—e Beit-Allam 6°62 5:09 
Kawamil | aie 

and | ue 2% | 0°02 O54 


Beit-Allam f Naqada ) e 


and | South | 


than five and the low value for 


NgNg 


+ With the usual notation a 


difference when it is greater than 5. 


19 Characters ... 


19 Characters ... 


19 Characters ... 


No +N,’ 


TABLE IL. 


El-Amrah Kawamil 


(7°7 


198+ 
2°79+ 


100 NB/NH' 


2 ys 
1°86 


) (14:2) 


0°34+°'11 
0°716+°19 


“ll — 
19 

‘ll | O-°764°11 
19 | 1°65+°19 


‘11 | 0°48+°11 
"19 | 0°39+°19 
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Coefficients of Racial Likeness between Fouquet’s Pre-dynastic Series*. 


Naqada 


Beit-Allam South 
es) (25-9) 
1°17 + 11 198+ "11 
165+°19 | 2°79+°19 


0 
l 


l 


2 


‘76+°11 | 0-48+"11 
65+ °19 0°39 + °19 


164+ °11 
— 2°65 + °19 


16+°11 
"65+ °19 


* The numbers in brackets following the designation of the races are the mean numbers of crania 
available for the characters used in computing the Coefficients of Racial Likeness. 

Although the series are short, the lowness of the Coefficients in Table II may 
be taken to indicate very close affinity. ‘The Kawamil series most closely represents 
the mean type and the El-Amrah skulls stand rather apart from the others. Ten 
characters—viz. 100 H’/L, L, H’, LB, U, J, NB, O,', O. and fmb—of the 19 com- 


pared show no significant differences between the four series. The values of at for 





100 0,/0;' 100fmbj/fml B B’ | NH’ | fml 





2°07 1°46 
1°32 5°59 
11°53 1°24 
0°30 2°36 
5°17 0°07 
1°44 3°38 


the Naqada South series 


{M,—M,'| 
: «= 4 


2°60 O'Ol 67 


The orbital is the only index which shows more than one value of a greater 


is responsible for the 


divergences in that case. The small frontal breadth (B’) of the El-Amrah, the 
small nasal height (VH’') of the Naqada South and the large foraminal length 
(fml) of the Beit-Allam series are also more or less distinctive. But by far the 


and its value is taken to indicate a significant 


greater number of the cranial characters, and in particular those of the calvaria, 
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are so similar for the four series that they might represent identical types. It is 
ivasonable to suppose that three of the series—the Kawamil, Beit-Allam and 
Naqada South—are quite pure representatives of the same race: their pooled means 
are given in Column 6 of Table I (p. 14). The El-Amrah skulls diverge slightly 
from that type. The standard deviations of the three pooled distributions may be 
compared with those of the long series of 26th to 30th Dynasties skulls which is 
dealt with in a later section of this paper. 


E B 100B/L |  B’ | 100 NB/NH’ 
| 


=e =e | 


| 


| 

| 

| 

| ° 

| 26th to 30th Dynasty Series . | 5°72+-09 | 4:76+°08 | 268+°04 | 4:05+-06 3°82 + 06* 


Fouquet’s Pre-dynastic Series | 5°79+°36 | 506+°31 | 3°114°19 | 641+°39 | 1°69 +°35 


= so | 


* This S.D. is not of 100 NB/NH’ but of the very similar index 100 NB/NH, R. 


The population made up by combining Fouquet’s three series is certainly less 
homogeneous than that of the 26th to 30th Dynasties, but a comparison of the 
individual distributions suggests that the heterogeneity is inherent in each of 
them and is not due to the pooling. Provisionally we may assume that the pooled 
means are those of a pure racial type and we may speak of that as the Egyptian 
Aeneolithic type*. It will be shown that this primitive Aeneolithic race is most 
clearly differentiated from all the racial types that inhabited Egypt in Dynastic 
times and it must have been exterminated altogether, or, as the cultural evidence sug- 
gests, absorbed in a much larger population. The type is dolichocephalic and among 
African races it is distinguished by its large calvaria. Many modern races equal it 
in skull length, but apparently none but primitive types have as great a skull 
base (nasio-basial length) (ZB = 105°5). The Eskimo ZB length is 1049, the Aino 
105°4 and the Moriori 1056+. It is reasonable to suppose that this primitive 
Egyptian Aeneolithic race was directly descended from the people whose palaeoliths 
are found on the high terraces on either side of the Nile Valley. 

At least one other race was living in Upper Egypt at the time when the 
Aeneolithic people were still distinct in type. In the well-known work of Thomson 
and Maclver(13) individual measurements of 1500 skulls are provided. The 
material is divided up into series dating from Early Pre-dynastic to Roman times. 
Unfortunately the measurements are few in number and many of them are not 
defined precisely although there can be no doubt that they correspond exactly to 
measurements that are ordinarily taken in the Biometric Laboratory. 

The Pre-dynastic skulls are divided into two groups, an early and a late; the 
relative ages having been determined by the method of Petrie’s sequence dates. 
Of the ¢ skulls, 20 of the Early Pre-dynastic came from Abydos, 10 from El-Amrah 
and 25 from Hou: the pooled means are shown in Column 9 of Table I. Twenty- 


Although a few small metal objects were found in the early graves opened by J. de Morgan, 
he supposed that they belonged to a Neolithic stage of culture. 
+ These values are taken from Biometrika, Vol. xtv. p. 222. 


Biometrika xv 
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eight of the Late Pre-dynastic ¢ specimens came from El-Amrah and 99 from Hou; 
their pooled means are in Column 10*. 


Several large Early Pre-dynastic cemeteries were excavated by Petrie and 
Quibell in 1895 and an account of the work was given in the following year+(3). 
Faweett’s paper of 1901 (6) dealt with 300 of these skulls. They had been taken 
from at least four distinct cemeteries within a radius of three or four miles, but the 
grave furniture found at the different sites was so similar that the fact that 
a homogeneous population was being dealt with appeared to be conclusively 
established. An examination of the skull measurements led to the same con- 
clusion. The standard deviations of characters are quite as low as for populations 
which craniologists accept as homogeneous, though the majority of them are slightly 
greater than those of a long series—the FH series—of Egyptian skulls ef the 26th to 
30th Dynasties which will be dealt with in a later section of this paper?. A com- 
parison of the mean measurements of the series from different cemeteries suggested, 
however, that the divergences, though small, might be of significance when con- 
sidered in relation to the quantity of material that has become available since 
Fawcett wrote. By far the greater number of the so-called Nagada crania came 
from two cemeteries: one near the village of Nubt and the other two miles to the 
north towards Ballas§. These form the first series—the skull numbers not preceded 
by a letter and which for convenience we will term the A series—and the Q series 
respectively in Fawcett’s tables of individual measurements. The 7’ and B series 
were from two distinct burial groups within half a mile of the first site. The mean 
measurements of the A and Q series are almost identical and their combined 
means|| are shown in Column 7 of Table I. The 7, B and R series are very similar 
to one another and they agree W ith the A and Q series for the majority of characters 
but are yet sufficiently remote for others to make a further enquiry desirable. 
The splitting up of a relatively short series of crania into groups which are supposed 
more or less racially distinct, when the whole material came from a single district 
and was homogeneous from the standpoint of the archaeological evidence, is a 
practice that the biometric craniologist is bound to discredit unless very strong 
evidence can be adduced to support it. There are certainly not two distinct types 

* In the earlier work of Maclver(12) measurements of smaller numbers of Pre-dynastic crania 
were provided. The means of these—taking Early and Late periods together—were reduced by 
Miss M. A. Lewenz and Professor Karl Pearson and quoted by Fawcett ((6), p. 426). Individual measure- 
ments of 33 of the 44 skulls were given again in (13), so the earlier series is no longer of any importance ; 


it is obviously intermediate between our Early and Late Pre-dynastic types. Maclver gave orbital 


measurements that were not reproduced in (13), but when divided up the numbers become so small that 
I have omitted to consider them. 

+ It was orignally thought that the Naqada people were of Early Dynastic date but further evidence 
showed that they must be assigned to the Pre-dynastic period. 

{ The standard deviations of the Naqada, the H and three other series are compared in Biometrika, 
Vol. xiv. p. 208. 

§ The ‘‘ New Race” cemeteries excavated by Petrie and Quibell were four or five miles north of the 
village of Naqada. The sites are shown in (3) Plates I and Il. J. de Morgan worked in Nagqada itself and 
it was there that he found Menes and the Naqada South skulls. 


With the modifications of Fawcett’s measurements explained in the notes to the Table itself. 
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among the Naqada crania. There were two distinct racial types in early Egypt, 
but they were quite as similar to one another as adjacent racial populations are 
generally found to be. Nearly all our cranial series represent intermediate types 
between those two and the differences between them are very small; so small that 
with less abundant material it would have been impossible to say that they had 
a real statistical significance. Our contention is that Fawcett’s Naqada series of 
skulls contains representatives of two slightly differing populations which are both 
intermediate between the two extreme Egyptian types, and the evidence in favour 
of that view is partly deduced from a comparison of the two groups themselves, 
but its real justification is found in comparing their affinities with other early 
series. The Coefficients between the Naqada groups and Thomson and Maclver’s 
Pre-dynastic series are given in the following Table: 


TABLE IIT. 


Coefficients of Racial Likeness between some Pre-dynastic Series. 


Nagada 


' Naqada 
Sarlv Pre. f *re- nm 
Early Pre 4 and aq | Late Pre B,T andR 
| dynastic aoa dynastic Rita 
(40-9)* Series (1056) Series 
| . (66:1) (18-6) 
| = | 
Early Pre-dynastic | 14 Characters sake — 0°314+°13 | 2174+°13 | 355+°13 | 
10°9 6 Indices and Angles - 0°53 4°19 2°09 + °19 6°04+°19 
Naqada A and Q | 14 Characters vee | O3I+°13 073+°13 | 162+°13+ 
Series (6671 6 Indices and Angles} 0°53+°19 1624°19 | 3°25+°19 | 
Late Pre-dynastic | 14 Characters woe | VITH 13 = | O734+°13 1-66 +°13 | 
105°6 | 6 Indicesand Angles) 2°09+°19 1°62 +°19 3 40+°19 | 
Naqada B, 7' and | 14 Characters .. | 3554°13 162+ °13t) 166+°13 == 
R Series (18°6 6 Indices and Angles} 6°04+°19 | 3°25+°19 | 3404-19 


See footnote to Table II, p. 16. 

+ These Coefticients are based on 14 characters and 6 indices and angles so that they may be com- 
parable with the others in the table. When they are calculated for the full 31 characters, of which 12 are 
indices and angles, they become 0°65 + -09 and 1°54+: +14, so they are substantially reduced. The added 
characters show no significant differences. 


The low order of the Coetticients in Table III shows that we are dealing with 
closely related types. In comparing them it is important to pay due regard to the 
number of skulls available for each series—the number in brackets following the 
designation of the race. The Coefficients involving the Naqada B, T and R series 
are all symptomatic of greater racial divergences than those between any pair of 
the other three types, while the former stands closest to the Late Pre-dynastic. 
The Nagada A and Q group is almost identically the same as the Early Pre-dynastic, 
but between the latter and the Naqada B, 7 and R group there is a difference in 
type on which considerable stress may be laid as the numbers of skulls in the two 


92 
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series are small. The Coefficients arrange the series in the order shown in the 
table and that order is a perfectly consistent one. Turning to a comparison of the 
individual characters we find that 8 of the 14 compared—viz. 100 NB/NH’, L, H’: 
LB, G’H, NH’, NB and A Z —show no significant differences whatever. The only 
values of a greater than 5 are for the characters shown below. 


B \1OB/L \100B/H’, J \100H'|L| Nz 


| esky Pepdipmantic ( Naqada Aand @ ... | 2°45 0:07 1°63 3°93 | 2°39 | 4°14 
ee nd. Late Pre-dynastic... | 6°55 | 0°82 9°06 12°74) 753 O11 
. | Naqada B, Tand R , 15:13) 11°54 18°79 1°80 | 2°82 7°37 
| ———————EE “: 7 al = e 
Nagada A and Q@ {Late Pre-dynastic ... L14 0°62 3°85 151|) 1°76 1°81 
and | Naqada B, 7and &... | 8°96) 11°95 11°18 0°04} 0:42 L156 
| - es 
| Late —— Nagada B, Tand R, 6:00, 9-47 21 040! 0-02 | 7°83 


The most significant differences are between the calvarial and zygomatic 
breadths and the indices 100 B/Z and 100 B/H’ which vary with B while H’ and L 
remain constant. No stress can be laid on the single significant difference between 
the indices 100 H’/Z. The two values of a greater than 5 in the case of the nasal 


angle are of greater significance. The actual measurements are : 


| B 100 B/L | 100 B/H’ J H’ Nz | 
| 
| | 
Early Pre-dynastic 131-4 | 717 =| = 98°1 124°3 134°0 67°°O 
Naqada A and Q... 132°7 | 71:8 | 99:2 125°9 133°8 | 65° 
Late Pre-dynastic 133°4 72°1- | 1005 127°0 132°7 66°°8 
Naqada B, Tand R 136°1 74:0 | 103-0 126°2 132°2 64°°3 


The gradation shown in the case of the first three of these characters confirms 
the reasonableness of the order suggested by the Coefficients. H’, though showing 
no significant differences, exhibits the same order. The discordance in the sequence 
of zygomatic breadths may well be attributed to the errors of random sampling. 
The variability of the nasal angles would appear to be due to some other cause 
and it is almost certainly the result of slight differences between the methods of 
measuring the sides of the fundamental triangle employed by the workers concerned *. 
The differences between the two Naqada groups are precisely similar to those which 
distinguish them from Thomson and Maclver’s two series. From Early to Late 
Pre-dynastic times—a period of at least 2000 years—the change in the skull form 
of the population of Upper Egypt was exceedingly small. It may have been due to 
slow, progressive evolution or to slight admixture with another strain. A comparison 
of these Pre-dynastic types with Fouquet’s Aeneolithic will be of interest. The 
Coefficients are : 


Probably due to differences in the determination of the alveolar point : see Note p. 53. 
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TABLE IV. 


Coefficients of Racial Likeness between Fouquet’s Aeneolithic and 
other Pre-dynastic Series. 

















Early Pre- Naqada | Late Pre- Naqada 
dynastic d and Q dynastic B, Tand R 
(40-9) (66-1) (105-6) (18-6) 
: ; L 
All Characters we | 11-2844 | 9184-11 1071414 | 5284-11 
_., | Number of Characters | 11 19 11 19 
|Aeneolithic | Se one 2 | 
(48°7) , | - 
| 48° Indices and Angles ... 1°314+°24 | 0°29+°19 1°664+°24 | 141+°19 
Number of Characters { 6 4 6 


The most striking feature of Table IV is the great difference between the 
values of the Coefficients for all characters and for indices and angles only. With 
Thomson and Maclver’s and Fouquet’s series the only four indices that can be 
compared are 100 B/L, 100 H’/L, 100 B/H’ and 100 NB/NH’, while two others— 
100 0,/0,' and 100 fmb/fml—take part in the comparison with the Naqada groups. 
Only one difference of indices is significant; that is with the Naqada B, T and R 
series where a= 6°87 for 100 B/H’. The high values of the Coefficients for all 
characters are due entirely to the greater size of the Aeneolithic calvaria: facial 
direct measurements show no significant differences whatever. So the Aeneolithic 
type was almost identical with the Pre-dynastic Egyptian in size and shape of face 
and shape of calvaria, but it greatly exceeded the more advanced type in absolute 
size of calvaria*. The relation is a curious one and we can offer no explanation of 
it, but must consider the types to be perfectly distinct. The Aeneolithic people 
may have been absorbed into the larger population, but there is not the slightest 
suggestion that the latter was thereby modified in any way. 

In the last column of Table I the mean measurements of a series of Pre-dynastic 
skulls discovered at El-Khozan and measured by E. Chantre are quoted from 
(15), p. 57. The measurements are presented in such an altogether unsatisfactory 
way that it is not possible to determine the relationships of the type with any 
certainty+. From a study of the grave finds Chantre supposed that the skulls 
belonged to the same race and were contemporary with J. de Morgan’s series 
(see (14)) and he said that this conclusion was confirmed by the similarity of the 
skulls themselves. But the mean measurements are obviously very dissimilar to 
those of Fouquet’s Aeneolithic type. They apparently resemble the Late Pre- 
dynastic type more than any other. No stress can be laid on the significance of 
the small H' and J of the El-Khozan skulls as either mean may easily have been 
based on no more than half of the 24 specimens. 

Judging from the drawings provided by Fouquet in appendices to J. de Morgan’s volumes (10) 
and (11) the skulls of the Aeneolithic type had well-marked superciliary ridges, mastoids and inionic 


prominences and were decidedly more muscular than dynastic Egyptian crania. 
+ See notes to Table I. 
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A comparison of all the available skull measurements leads to the conclusion 
that in Pre-dynastic times there were two distinct racial types in Upper Egypt. The 
first and more primitive was characterised by a large calvaria and it was probably 
represented by a small and dwindling population. The second, which was carried 
down to historic times, was being transformed by some slow process of evolution or 
racial admixture. Both main types had low cephalic indices which distinguish 
them from the type of all later Egyptians. The apparent relationships of the 
various series are represented schematically in Fig. 1, p. 2. 

4. Skulls of the First to the Siath Dynasties. The first six dynasties cover the 
protodynastic era and the Old Kingdom, At the beginning of that period a change 
in the administration of the country took place which was likely to be of consider- 
able ethnic importance. The kingdoms of Upper and Lower Egypt had been ruled 
by one king before the accession of Menes, the founder of the First Dynasty, but he 
unified the country and facilitated the intercourse between the north and the south. 

All the available mean measurements of Early Dynastic series of skulls are 
shown in Table V. The series in the third and later columns is one which was 
measured by Mr G. H. Motley and the means are now given for the first time*. 
The collection consists of 48 skulls excavated by Sir Flinders Petrie in 1922 and 
presented by him to the Biometric Laboratory. They belonged to court-officials of 
the reigns of Zev and Zet, third and fourth kings of the First Dynasty. The 
bodies were buried in a square, the graves forming the border or sides, at Abydos 
between the Shuneh and the Coptic Deir. Each king had a fresh square of burials*. 
The full account of these excavations will be given in a volume entitled Tombs of 
the Courtiers, to be published shortly by Sir Flinders Petrie. 

Of the 48 skulls one (No. 443) was almost certainly that of a negro ; it is possible 
that some of the others showed, but far less definitely, negroid characters. Of the 
remaining 47 crania, 37 were allotted to the male, 10 to the female sex. Actually 
81 crania were obtained and in sufficiently good condition to be treated with 
paraffin preparatory to transport; through some misadventure only 48 were packed 
and reached England. A series of 15 measurements were taken by Sir Flinders 
Petrie in the field; seven of these, our Z (the maximum length), B (maximum 
breadth), 1’ (the basio-bregmatic height), ZB (the basio-nasal height), G’ H (upper 
face height, nasion to alveolar point), GZ (profile length from basion to alveolar 
point) and J (the bizygomatic breadth), are customary cranial measurements. The 
remaining eight measurements were taken for comparison with statuary, and it is 
less certain that they are: Orbital Breadth (? our O,), Jaw Breadth (? our condylar 
width W,), Facial Height (Nasion to base of jaw), Jaw Height (gnathion to infra- 
dental + height of teeth), Nasidental Height (nasion to line of teeth), Nasal Height 
(‘our NH’), Internal Orbital Width and Eaternal Orbital Width, both measured 

* Measurements of the individual skulls and remarks on cranial anomalies will be found in the 
Appendix to this paper. 


+ There is reason to believe that at least some of the interments must, by the attitude of the 
skeletons, have been cases of persons buried alive. An entire skeleton, to illustrate this, was brought 
back by Sir Flinders Petrie and is now mounted in the Museum of the Laboratory. 
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inside the orbits. As we invariably measure the external and internal ocular 
distances on the living subject, it seems highly important to include these among 
craniological measurements. I am inclined to think that the external orbital 
distance is better not taken inside the orbit, as then for comparison with the 
living we must allow for thickness of flesh; it would, I suggest, be advisable to 
take it from the inside border of one orbital ridge to that of the other, the 
measurement to be taken at the fronto-malar sutures, which can be closely 
approximated to in the living. The terminals of the internal orbital measurement 
are less easy to standardise. We have already two craniological measurements of 
this kind, namely DC the dacryal chord, and SC the minimum chord between the 
two naso-maxillary sutures, the so-called simotic chord. There is no doubt that 
the dacryal chord would better represent the internal orbital distance, and might 
be adopted for this purpose to save multiplication of cranial measurements*. If we 
add to the dacryal chord the widths of the two orbits we ave for the mean total 
2 21°56 + 40°65 + 40:09 = 102°30) 
i 22°82 + 42°77 + 42°33 = 107-92) ° 
external ocular distance as the three segments are not in the same straight line. 


But this is not 4 .1e measurement of the 


Using the measurement indicated above we find that the external ocular distance 
for males = 98°4 and for females 93°2 mms., while Sir Flinders Petrie’s measurements 
for these 37 and 10 crania are 96°4 and 92°1 mm., so that taking the measurements 
“inside the orbits ” makes a difference of 2 mm. in the male and 1 mm. in the female. 
The measurement from one fronto-malar to the other fronto-malar point seems to be 
somewhat in excess of the “inside the orbits” external ocular distance. It has the 
advantage, however, of being both an easily determined length on the skull and 
also a readily measured quantity on the living head. On the crania we have com- 
pared, Sir Flinders Petrie’s field measurements, when taken for the same characters 
as those used by us, agree reasonably, 0°5 to 1 mm. being the usual difference. 
A much greater divergence occurred in the sexing. We were at first much inclined 
to make the whole series male, but have finally admitted ten to be female. There 
is no archaeological evidence of sex, and failing this, we feel compelled to retain as 
a minimum four-fifths of the material as male. It is of course difficult with a short 
series of skulls, where we are not compelled by the circumstances of the interment— 
very peculiar in this case—to suppose both sexes represented, or represented in 
approximately equal numbers, to draw a marked line anatomically across the series 
separating it into two sex groups. It was indeed not hard to classify the material 
into two categories, and these might have been regarded as a sex-division; but 
they appeared to us to correspond far more to racial than to sexual differences. 
In other words we doubt the homogeneity of our material, we believe it to consist 
of a mixture of races, or better a race with individuals of more or less mixed blood. 
Our value of the dacryal chord for 34 4’s is 22°8 mm. Sir Flinders Petrie’s internal orbital width, 
measured ‘inside the orbits,” is 25-6 mm. for the same crania. This would only give a difference 
of 0:7 mm. in determining the distance of the pupillary centre from the sagittal plane and would serve 
for portraiture. 
+ The field measurements were taken before paraffining, but that process seems to have made no 
appreciable difference to the skulls. 
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Without venturing to be in the least dogmatic, on such slender evidence, we should 
expect a divergence from the prehistoric Egyptians by their mixture with a race 
of a coarser, stronger type. If the line drawn across the material corresponds to 
more and less of this stronger coarser race, it will be obvious that there is danger 
in such a small series of confusing such a racial line with a sex line. Were anyone 
to assert on other grounds than we have had before us that the whole series is male, 
there are certainly only a very few crania about which we should be inclined to 
contest his assertion. 

Measurements of two other series of First Dynasty skulls have been furnished 
by Thomson and Maclver (13). One came from a private graveyard and the other 
from the Royal Tombs of the First and Second Dynasties; both were from Abydos. 
The same writers have provided measurements of Fourth and Fifth Dynasties skulls 
from Deshasheh and Medum in Lower Egypt. The mean measurements are shown 
in Column 5 of Table V. 

The Coefficients of Racial Likeness between the above series were calculated. 
The numbers of skulls are fairly adequate but unfortunately there are no more 
than 14 characters, of which 6 are indices and angles, that can be used for that 
purpose. 


TABLE VI. 
Coefficients of Racial Likeness between Early Dynastic Series. 


| 


| lst Dynasty’ 1st Dynasty | 1st and 2nd 
Private Royal Tombs! Dynasties 








Tombs (Motley) Royal Tombs | = 
(33-6) (33:2) (21-8) | wedi 
| 
Ist Dynasty ... | 14 Characters ve 5:37 +°13 3°54+°13 5°59 +°13 
Private Tombs (33°6) 6 Indices and Angles — TIZ+°19 5°28+°'19 6-97+°19 
= BON S aw _ 2 
Ist Dynasty . | 14 Characters .. | 5374-13 0°66 +°13 2°80+°13 
Royal Tombs (33:2)... | 6 Indices and Angles T72+°19 0°36 +°19 1°62+°19 
Ist and 2nd Dynasties | 14 Characters .-» | 3544°'13 0°66+4°13 — 0°39 +13 
Royal Tombs (21°8) ... | 6 Indices and Angles | 5-28+°19 | 0°36+4°19 = 0-07 +'19 


4th and 5th Dynasties | 14 Characters ... | DSO+'1B 2°80 +°] 
(39°9) 6 Indices and Angles | 6°97+°19 162+°1 


3 0°39 +°13 
) O07 +°19 


The most striking feature of this table is the highness of the Coefficients 
between the First Dynasty skulls from the private tombs and the other series. 
The numbers of skulls being small, the Coefficients are of an order which must be 
taken to indicate perfectly distinct racial types. The two sets of skulls from the royal 
cemeteries are very similar to one another and are far more closely related to the 


later type from Lower Egypt than to the contemporary civil population of their 
own town. The Coefficient between the First and Second Dynasties series and the 


4th and 5th 


Fourth and Fifth Dynasties series is as low as it might have been if the two had 
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been drawn from the same population. Motley’s series, however, is distinct from 

that of the Fourth and Fifth Dynasties. From this evidence alone, which by itself 

would be very scanty of course, we may suppose that the courtier population of ; 
Abydos in the First and Second Dynasties was intrusive and that it came from 

Lower Egypt where the type persisted until at least as late as the Fifth Dynasty. 

A comparison of the individual characters of the four series can be made in the 

usual way by considering the values of the a’s between them. There are only four 
characters—viz. L, LB, NB and A Z—which show no values of a greater than 5. 

The a’s for the remaining ten are given in the table below. 


All the greatest values of a are in the first three rows of this table. The indices 
100 B/H’ and 100 B/L of the First Dynasty Private Tombs series are quite distinct 
from the values for the other three types and the divergences are evidently due 
primarily to differences in calvarial breadth. In the last rows there are only five 
characters with values of a greater than 5. In the whole table the only significant 
differences of GH and NH’ are those from Motley’s First Dynasty series. The 
mean facial and nasal heights of those skulls are greater, and in the majority of ' 
cases, quite significantly greater than those of any other ancient Egyptian series; 
and other characters, not provided by Thomson and Maclver, also distinguish 
them, so there is clear evidence of admixture with some alien strain. At the 
same time they stand very close to the other Royal Tombs series of the First 
and Second Dynasties and they are primarily of Egyptian type. The Fourth and 
Fifth Dynasties series from Lower Egypt is distinguished from the royal retainers 
of the First and Second Dynasties by having a much greater basio-bregmatic 
height. 


100 100 : 100 100 ; os sane Ds 
Ba’ | BL | © | nyt \|NBjNH’| 2 | @H |NH’| J | Ne 
Ist Dynasty 
Ist Dynasty | Royal Tombs 20°84} 3°26! 5°35!10°98) 6°51 5°01 | 14°64 | 9°60 | 0-95 | 10°54 
Motley 
Ist and 2nd | 
Private Dynasties 17°56 | 10°80 | 11°22 | 4°24] 2°92 12°38! 1°76 | 0°36 | 4°68) 0°52 
Tombs Royal Tombs | 
ith and 5th \ | 13-49 | 93-02 | 32-18) 2°50) 5:80 | 0-51 | 0-321 075/674] 2-69 
Dynasties | 
Ist and 2nd 
Ist Dynasty Dynasties 0°01; 2°60; 1°51) O°78| 0°31 0°21 4°41 | 4°82 | 0°02| 4°68 
Royal Tombs | 
Royal Tombs} 4th and 5th ) - 
. . 6 7D 9-29 3+ 0: 9°57 92 | 6:28 | 3 52 
Motley Dynasties 1°61 ri 7 1] 57 19 ) 3°41 ) 
Ist and 2nd - 
Dynasties ith and 5th 1 1.48} 9-60! 1-98, 066] 0-08 |5:08| 0:46/0-01 3-00] 0-70 


peta Jynasties 
Royal lombs| I ynastie } 


The Coefficients between these Early Dynastic and the four Pre-dynastic series 
previously considered are given in Table VII. 
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TABLE VII. 


Coefficients of Racial Likeness between Early Dynastic and Pre-dynastic Series 








| | Ist Dynasty | Ist Dynasty | 1st and 2nd | 4th anda 5tl | 
Private Royal Tombs | _ Dynasties | ~aes Pm 
Tombs Motley | Royal Tombs | ‘aoa, - 
| | | (83-6) (33-2) |" (1'8) (39-9) 
| Pea | a San hues nad 
Pasir Pro. | 
| oe 14 Characters we) 2°304°13 | 1215413 | 7644-13 | 11-864°13 | 
peer 6 Indices and Angles O11+°19 | 12°174°19 | 8°62+°19 13°36+°19 | 
| ; 2 . | 
| N 1 | ie te | precery 
A acd g | 14 Characters | 1924°13 | 968+-13*| 5°98+°13 | 10°79+°13 | 
“ (661). 6 Indices and Angles | 0°46+°19 | 8°60+°19 7°81+:19 12°45+°19 | 
an | 
= 
Late Pre- ‘ = ; | 
| eect 14 Characters vo| §=2°17E°13 9°82+°13 1°88 +°13 11°424+°13 
105°6 ; 6 Indices and Angles 1°84+°19 9°99 + °19 6°55+°19 13°75+°19 
LEE: Ti, SPR aires 2s Ree 
| ~ T and p’ | 14 Characters wa. | 2454-13 | 2°59+4-138*} 0564-13 1:27+°13 
| agg. | 6 Indices and Angles | 2°784°19 | 0224-19 | 0-374-19 0°33+-19 | 


* The Coefficients between Motley’s First Dynasty series and the two Naqada can also be calculated 
for the whole 31 characters, of which 12 are indices and angles; with the A and Q they are 6°36 +-09 
and 6°65+-14 and with the B, 7’ and R 237+: 


09 and 0-90-14 respectively for all characters and for 
indices and angles only. 


The Coefficients in Table VII show that there is a distinct line of cleavage 
dividing the series inte two opposed groups. The Private Tombs series of the 
First Dynasty belongs clearly to the Pre-dynastic stock and it is apparently most 
closely related to the Late Pre-dynastic type which is represented by more skulls 
than the other two. And those four series are most clearly differentiated from the 
other three arranged horizontally in the table; the latter form another group of 
closely inter-related types. For convenience we may term these groups the Upper— 
the Pre-dynastic—and Lower Egyptian types respectively ; the appropriateness of 
the titles will be more apparent when later series are considered. The Naqada A 
and @ group behaves in every way like Thomson and Maclver's Pre-dynastic types 
but the distinctiveness of the B, T and R skulls is again brought out in Table VII. 
The series being a short one its Coefficients are for the most part of a lower order 
than the others in the table but it clearly inclines more to the Lower than to the 
Upper Egyptian type and may be considered intermediate between the two. The 
division between the two main groups is quite sharp, but there is very definite 
evidence to show that the transformation from one to the other had started in 
prehistoric times. A comparison of all the Coefficients shows that the Early Pre- 
dynastic is the series most widely removed from the Lower Egyptian type and the 
Naqada A and Q group is more distant than the Late Pre-dynastic. A similar grada- 
tion, providing the principal justification for the arrangement shown in Fig. 1, p. 2, 
is found among the Lower Egyptian series. The relationships are brought out more 
clearly by comparing individual characters. Whether comparison is made between 
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series within either group or between series of different groups, the same characters 
always show the most significant differences and the same are constant throughout 
so long as Motley’s First Dynasty series is not concerned in the comparison. From 5 





the latter, greatest divergences—i.e. the highest values of a—are in every case for 


the characters G’H and NH’, but otherwise it behaves like a pure Egyptian series *. 


. . . . 
In general the relationships between these series are such that: 
(i) L, LB, NB and AZ show no significant differences whatever, 
(i) 100 H’/L, G’H and NH’ are practically constant 
(iii) NZ, H’ and 100 NB/NH’ have small but orderly differences, and 
(iv) the greatest differences are found always for B, 100 B/H’, 100 B/Z and to 
a less marked extent for J. 
The measurements showing most significant differences are compared below : 
oS ob 
= a Bes a 
| 100 | 100 » | 100 lw 
B | Bt | BH’ | 7 H’ | wpjnn’| N 4 
a a | 
Karly Pre dy ( 131°4| 71°7 98°1 | 124°3 34-0 50°3 | 67°°0 | 
| nastic \ | 
: | 
; | Naqada Aand) | 139.7] 71-8 | 99-2| 1295-9 | 133-8] 50-0 | 65-5 
( pper @ ( | | | 
Wed in} . nad v- | | | 
Egyptian | Late Pre-dy-( | 193.4] 72-1 | 100-5 | 127-0 | 132-7] 51-0 | 66°-8 
l'ype nastic if 
Ist Dynasty l | 
Private | 133°7 | 72°7 | 99°0| 130°9¢ | 135'1] 50-4 | 66°°2 
Tombs j | | 
| 
| 4 
BY, ‘ ‘ > yy" ' 
Naqada B, 7 | 136-1 | 74-0 | 103-0 | 126-2 | 132-2| 49-9 | 643 
and i ) 
| | 
| Pg | | — 
Ist Dynasty ) } | 
Royal Tombs >| 136°2 | 73°8 103°6 | 129°7 132°] 47 °9 63° +4 | 
| (Motley ) | | | | 
| pe Ist and 2nd) | } | | ' 
aaah Dynasties >| 137°7 | 74°9 | 103°7 | 129°9 | 132°8} 48°5 | 65°°S 
Egyptian ¢ src (| } 
Type Royal Tombs ) | 
. Dyn: | - 
th Dynasty! | 137 | 76-2 | 102-9 | 131 137 1671 
Broca \ | 
4th and 5th) | = oe 
d 1 139°3 | 75: 02-5 Ore 26°0 82 . ( 
Dynasties  § | 189 P 4 | 102°5 | L27°1t | 136 | 64 < 
+ These bizygomatic breadths are based on fewer crania than the corresponding calvarial breadths. 
In the case of the First Dynasty series B is the mean of 41 skulls and J of 25. The mean B for those 
25 is 134-8 so the J for the 41 would almost certainly have been greater than the recorded value and 
more in accordance with those of the allied series. In the same way it is probable that the Fourth and 
Fifth Dynasties J should be greater than 127-1. 
The most important differences between the Upper and Lower gyptian types 
dd 7 } 


are those associated with the breadth of the calvaria—B, 100 B/Z, and 100 B/H’: 
the latter is the broader and less dolichocephalic skull. Those differences are 
correlated with lesser differences between the zygomatic breadths. The Lower 
Egyptian type also has the lesser nasal index and nasal angle. The basio-bregmatic 

* We are only considering here the 14 characters measured by Thomson and Maclver which can be 
used in calculating the Coefficients of Racial Likeness. There are other characters which distinguish 
Motley’s First Dynasty series from all purer Egyptian types (see below pp. 42 and 43). 
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heights are less regular. The values of H’ for the Fourth Dynasty series are quite 
significantly greater than those for the First Dynasty Royal Tombs skulls and it 
will be shown later that the smaller values are those of the pure type while the 
others can only be accounted for on the supposition that the skulls represent 
a slight admixture with a race foreign to Egypt. So we may conclude that the 
Upper Egyptian type has a greater basio-bregmatic height than the Lower. The 
Fourth Dynasty series from Sakkarah, of which measurements are given above and 
in Column 6 of Table V, was measured by Broca. The measurements are in his 
unpublished note-books but the means have been quoted by several French cranio- 
logists*. Unfortunately they are given in whole mms. only and perhaps not to the 
nearest mm., while it is not known how many of the 31 skulls each mean is based on. 
The means are very similar to those of the contemporary series from Deshasheh 
and Medum measured by Thomson and Maclver-+. 

It will be convenient to consider first two later series from Lower Egypt. Ther 
are no others from that region: the majority of the ancient Egyptian skulls that 
have been measured came from the Thebaid. 

5. Series from Lower Egypt later than the Fifth Dynasty. A long series of 
crania belonging to the period from the 26th to 30th Dynasties and containing 
upwards of 1700 individuals has been measured in the Biometric Laboratory by 
Miss A. Davin and others, and a paper on this series has recently been published j. 
The object of that paper was to determine authoritative values for the correlation of 
cranial characters. This object can only be obtained by the measurement of a long 
series. The present use of that series is only a by-product of that enquiry, which 
has itself no direct bearing on the present paper. The standard deviations found 
for the F series have, however, been used in this and in other recent craniological 
work as constants for Coefficients of Racial Likeness. This is done because, being 
deduced from large numbers of crania, the values are more likely to express the 
general variation of the human skull within a fairly homogeneous group than 
standard deviations deduced from the small series which are all it is feasible to 
obtain as samples in the case of many racial groups. The mean measurements of 
the 26th to 30th Dynasties skulls are given in Table XIV. The specimens came 

The means given in Table V were taken from (16), p. 507 and (15), p. 57. 

| In the winter of 1924—5 the British School excavated sites at Qau, 30 miles south of Assiut, 
of the Fourth to the Ninth Dynasties. The human remains discovered are of particular interest because 
there are no records of others from Middle Egypt. Through the courtesy of Sir Flinders Petrie I am able 
to give here the means of his field measurements of some of the ¢ skulls. The series has been presented 
to the Biometric Laboratory and a full report on it will appear later. The means are: L=184-7 (31), 
B = 139°6 (31), H’ = 136-9 (30), J=127-7 (28), Dé 24-6 (30), GIL =95°5 (29), LB = 101-2 (30), 
GH = 710°3 (29), NH’? =51-2 (29), NB=25-4 (29), 100B/L = {756 (81)}, 100 H’/L = {741 (30)}, 
100 B/H’ = {102-0 (30)}, LOONB/NH’ =49°6 (29), N 2 = {64°35 (29)}, A Z 3°8 (29)}, BL = {41°7 (29)}. 
If these measurements are compared with the Fourth and Fifth Dynasties series from Deshasheh and 


14 
Medum provided by Thomson and Maclver (Table V above) the two will be seen to be remarkably 
similar. Fourteen characters can be used in calculating the Coefficient of Racial Likeness between 
them, giving C.L. = - 0°50 +-+13 and C.L.= - 0°46 +-:19 for 6 indices and angles. So, as far as we can 
tell, the populations were absolutely identical and they only differ from the pure Lower Egyptian type 
by having a greater basio-bregmatie height. 

; $ standard devia 
tions had previously been published in Biometrika, Vol. x1v. p. 208 and the Q in Vol. xvi. p. 14. 


Biometrika, Vol. xv1. pp. 237—239. The series is known as the F series and the 








| 


E Series, 26th to 30th Dynasties | 185°3 (895) | 138-9 (896) | 13 
4th Dynasty, Sakkarah (Broca) ... | 185 31)}141 ( 31)]13 
Ptolemaic Period, Sakkarah(Broca) | 181 13) | 139 13) | 13 


E Series, 26th to 30th Dynasties 3 3 
tth Dynasty, Sakkarah (Broca) ...| 38 ( 31)) 3: 
Ptolemaic Period, Sakkarah ( Broca 3 3 
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from a single cemetery south of Gizeh which was excavated by Sir Flinders Petrie 

(see (5), p. 29). In Table VIII the measurements of a short series from Sakkarah 

dating from Ptolemaic times are compared with two other series from Lower 

Egypt—the 26th to 30th Dynasties from Gizeh and Broca’s Fourth Dynasty 

series from Sakkarah. The individual measurements of the Ptolemaic skulls are in 

Broca’s unpublished note-books and the means are given by Chantre ((15), p. 131) 

and other French craniologists. They are presented in as unsatisfactory a way as 
the rest of that material (see footnote |! to Table I, p. 15 above). 

TABLE VIII. 
Mean Measurements of Lower Egyptian Series. Males*. 
L B =" is J DC 


| 

ne ' : } = 
} 
| 


32°4 (884) | 94°8 (935) | 128-7 (785) —~ 
7 


( 31)| 95 31 


/ 


Oy O» NH’ NB fml fmb 


2 

2 

3 31)| 52 31) 24 31 36 (31 30 
3 13 50 13 24 ( 13 35 13 29 


|} 100 B/L 100 H’'/L | 100 B/H’ | 100 O./0,' —_ — 


A full account of the measurements of the 26th to 30th Dynasties series is given in the footnote 
to Table XIV. The indices of Broca’s two series in Table VIII were all calculated from the mean 
values of the component lengths. 


The following Coefficients of Racial Likeness were found: 


TABLE IX. 
Coefficients of Racial Likeness with the E Series 
(26th to 30th Dynasties) from Gizeh. 





26th to 30th Dynasties (885°4) 


All Characters Indices and Angles 
C.1 Number of C.1 Number of 
7. | Characters | = Characters 
| 
Ist Dynasty Private Tombs (33°6) oes | 16°044°13 | 14 24°79+°19 6 
ist and 2nd Dynasties Royal Tombs (21°8 1:20+4°13 14 1°85+ "19 6 
| Ist Dynasty Royal Tombs (Motley) (33-2 9°75 + 09 | 31 4°25+°14 12 
4th Dynasty Broca) (31°0)... ay 3°89 +°13 | 13 3°62 + °24 | | 
|} 4th and 5th Dynasties (39°9 aes | 3'43+°13 | 14 3°44+°19 | 6 


131 31 24 (31 
36 13)|97 ( 18) | 126 13) 23 (13 





888 51°5 (898 24°4 (893), 35°6 (894), 30°0 (905) 


31 


2 


13 


NB/NH’ Smbf[fml 


E Series, 26th to 30th Dynasties | 75°1(884)| 71°5(884)|104-9(884)  87-8(886)| 47-4(893)! 84:5 (889 
1th Dynasty, Sakkarah (Broca)... | 76°2( 31)| 74°1( 31)|102°9( 31 86°8 ( 31 16°1( 31 83°3( 31 
Ptolemaic Period, Sakkarah(Broca 76°8( 13 75°1 ( 13)|102°2( 13)| 84°6( 13 48°0( 13); 82°9( 13) 
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The 26th to 30th Dynasties series contains such a large number of skulls that 
Coefficients between it and closely allied types may be as large as those which 
would indicate distinct racial differences when shorter series were being compared. 
All the Coefficients with the Upper Egyptian types (not given in Table IX) are of 
the same order as those with the First Dynasty Private Tombs series above. The 
26th to 30th Dynasties skulls are of the Lower Egyptian type and they show 
strong affinity with all the representatives of that race except Motley’s aberrant 
series. From the First to the 30th Dynasties the type had hardly changed at all and 
the latest series shows no progression on that of the Fourth and Fifth Dynasties. 
A comparison of individual characters confirms that conclusion. The values of the 
distinctive characters 100 B/H’, 100 NB/NB' and N Z as well as its cephalic index 
and calvarial breadth all relegate the 26th to 30th Dynasties series to the Lower 
Egyptian group and there is no suggestion of there having been any admixture 
whatever between the more stationary population of the Fayum and the Upper 
Egyptian race. If the EH series is intercalated between the First and Fourth 
Dynasties series we get a more or less orderly sequence for the characters B and 
100 B/Z but the increase in calvarial breadth is apparently not associated with an 
increase in zygomatic breadth. 


100 100 100 | 

| | J ne 
| | ? | Bu H’ | pia’ | a’ | 
| aurea 
Ist Dynasty Royal Tombs Motley ... | 136°2 | 73°8 | 129°7 | 132°1 | 103°6 | 71% 
| lst and 2nd Dynasties Royal Tombs ... | 137°7 | 74°8 | 129°9 | 132°8 | 103°7| 
| 26th to 30th Dynasties... Ste .-- | 138°9 | 75:1 | 128°7 | 1382°4 | 104°9 | 77 
| 
| 4th Dynasty (Broca sie aoe oo | dae | 762 1131 137 102°9 | 741 
| 4th and 5th Dynasties ... sid ..- | 1389°3 | 75°4 | 127°1 | 186°0 | 102°5 | 72°6 

Ptolemaic Period (Broca ee ... | 139 76°8 | 126 136 102°2 | 75:1 


The basio-bregmatic height (H’') and the indices 100 B/H’ and 100 H’/Z 
affected by it are responsible for all the greatest values of a between the six series. 
There is a clear distinction for those characters between the upper and lower 
triplets. This is somewhat surprising but it cannot disturb. our belief in the very 
close affinities of all the types. There is clear evidence to show that the pure 
Upper Egyptian type is represented by the 26th to 30th Dynasties series*, and 
hence we must suppose that the other has been modified by some process of 
evolution or racial admixture. As far as we can tell that process only affected the 
calvarial height so that the only significant differences between the Fourth and 
Fifth Dynasties series and the 26th to 30th are for H’, 100 B/H’, 100 H’/Z and NB. 
the last difference being small. We may note that the skulls of the First Dynasty 
series, which represent the Lower Egyptian type in Upper Egypt, are not absolutely 
pure, but they incline slightly towards the Upper Egyptian type, as shown in 
Fig. 1. That arrangement is justified by a comparison of all the Coefficients. 

The transformed Upper Egyptian types incline towards the 26th to 30th Dynasties series for all 
characters. 
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6. Upper Egyptian Series later than the Fifth Dynasty measured by Thomson 
and MacIver. There are six of these series and the mean measurements, which 
have not previously been published, are provided in Table XI. The number of 
skulls in each is sufficiently large to give a reliable determination of type. From 
a cursory inspection of the figures it is clear that three of the series resemble one 
another closely and are very similar to the First Dynasty Private Tombs skulls, 
The Coefficients of Racial Likeness are given in Table X. 

TABLE X. 
Coefficients of Racial Likeness between Series of Early and Middle Dynastic Crania. 











Paying due regard to the differing numbers of skulls in the series, we may con- 
clude from the Coefficients that they are arranged by their relationships in a 
sequence which is identically the same as the chronological order. This suggests 
that the skull form was being gradually modified in a particular direction though 
the change from the First to the 18th Dynasties was quite slight. Significant 
differences are only shown for the characters 100 NB/NH’', J, H’, NB, G’H and 
100 H’/L. The last three of these are characters which are constant for pure 
Egyptian types, but no real significance can be attached to their small and erratic 
differences in the case of the four Early Dynastic series. There is no gradation in 
calvarial breadths or cephalic indices, but 100 B/H’, 100 N/H’, H’ and N Z all give 
orders which are almost precisely the same as the one determined from the 
Coetficients and the series are obviously links in the transition from the Upper 
to the pure Lower Egyptian type of the 26th to 30th Dynasties series. 


| 100 B/H’ 


| 
H’ 100 NB/NH’ | 


Ist Dynasty Private Tombs ... 99°0 135°1 04 66° °2 

| 6th to 12th Dynasties 99°6 134°8 04 65°°7 
12th to 15th Dynasties 100°6 133-0 18°8 65°°7 
I8th Dynasty (Shekh Ali 1011 132°4 18°O 65°°6 
26th to 30th Dynasties 104°9 13274 17 "4 63°°9 | 


Part of the difference between the last series and the others for this character may be 
differences in the determination of the alveolar point. See p. 53 below 


due to 


| eee 6th to 12th | 12th to 15th | 18th Dynasty 
|. Wiles Dynasties Dynasties Shekh Ali 
| (33°6) (168-6) (65°9) (40-0) 
| ——_—__—_—_}|__ — 
| | | | 
ist Dynasty Pa 14 Characters a 288413 1:88+°13| 3-42+°-13 
Private Tombs (33°6)... | 6 Indices and Angles — |—O'13+°19| 0O°87+°19 1°58+°19 
» | 
| 6th to 12th Dynasties | 14 Characters = 2°88 +°13 — 2°54+°13 2°87 +°13 
168°6) 6 Indices and Angles 0°13 +°19 — 2°84+°19 3°21+°19 
| 12th to 15th Dynasties | 14 Characters a 1°88+°13| 2°54+°13 — 0°14+°13 
65°9 6 Indices and Angles | 0°87+°19 2°84+°19 0°25+°19 
18th Dynasty Shekh Ali | 14 Characters ~ 3°424+°13 2°87+°13) 0°144+°13 
10°0 6 Indices and Angle: 1°58+°19 3°21+°19 |-—0°25+°19 — 
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TABLE XI. 


Upper Egyptian Series later than the 5th Dynasty measured by 





Thomson and Maclver. 





Males* 











* All the indices 


lengths. 


Coefficients 








and angles 


in table 


TABLE 


were 





calculated from th 


XII. 


e mean values of the 











: : , ‘ ee o . Ptolemaic Roman 
Dynasty 6th—12th | 12th—15th 18th | 18th See | Period 
Locality Denderah | HOU8"d | gnexh ati | Abydos | Denderah | Denderal 

5 Abydos Kc | i f§ € € enderan 
= | 

F 181-4 (175) | 181-3 (67) | | 179-3 (50) 

E, 182°5 (175) | 183-0 (67) | 181-4 (50) 

3 134°3 (173) | 133°8 (66 136°4 (50 

H’ 134°8 (171) | 133-0 (66) 130°7 (50 

LB 100°8 (167 100°6 (66 99°9 (50 

GH 69°9 (169 71°0 (66) 70°5 (50 

J 124°5 (157) | 126-4 (61) 27°5 (46) 

NH' 51°2 (168) | 51:2 (67 51°6 (48) 

NB 25°8 (167) 25°O (67) 25°5 (48) 

GL 96°5 (166) | 96-7 (66) 94-2 (50) 
100 BL 73°6 (173) | 73°1 (66) 75°2 (50) 
100 HL 73°9 (171) | 72°7 (66) 72°1 (50) 
100 B/H’ =| 99°6 (171 L00°6 (66 104°4 (50) 

100 (B- H1')/L | 0°3 (171) 0°4 (66 3°1 (50) 
100 VB NH 50°4 (167) | 48°8 (67) 19°4 (48) 

VL 65°°7 (166) | 65°°7 (66 64°°3 (50) 

Ld 72°°8 (166) 72°°0 (66) 73°°2 (50) 

re 11°-5 (166) | 42°-3 (66) 42°*5 (50) 





component 


of Racial Likeness between series of the Lower Egyptian Type. 



































' | 2 | 
18th i Ptolemaic | Roman ist and 2nd {th and 5th | 26thto 30th 
Dynasty Sate | D> Dynasties } 
1 Tae Period Period <a Dynasties | "ea 
Abydos 76-6 | (49-3 | Royal Tombs 39-9 | (gas. 4) 
(49-9) (766) | 49°38) | “(ars) (39°9) | | 
| 
18th Dynasty | 14 Characters se = O89+° | 24°13) W1G#H'13 | O87+°13 | 5884-13 
Abydos (49°9 6 Indices and Angles O96 4°19) 1154-19] O51L+°19 | 0°61 = 9 | 540+°19 
y Cae : ee ea 
Ptolemaic 14 Characters os O89 +°1: ot }O°724°13| 0°644°13 ‘264°13 | 1005+" 
Period (76°6 6 Indices and Angles | —0°56+-"1! | 1°314+°19| 0°93+°19 riod 19 | 868+: “= 
Roman Period | 14 Characters 2°92 +1 O'72+°13 -- 0°65 +°13 3°55 +13 5°35 4°13 
19°3 6 Indices and Ang les L15+ 19 ila - O'10+°19 | O'48+°19 | 2°194+°19 
Between the short Naqada B, 7 and R Series (see above, p. 18) and Thomson and Maclver’s 
Ptolemaic we find coefficients of 0-04+-+13 for 14 characters and —0°88+--19 for 6 indices and angles. 


It is possible that the skulls that make up the former series represented a population of almost pure 
Lower Egyptian type which was intrusive in Upper Egypt in early times and had not mixed with the 


native 


population there. 


That 


the 


two Naqada groups were 


shown by the similarity of their grave furniture, 


Biometrika xv11 


almost certainly contemporaneous is 
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A comparison of the Coefficients of Racial Likeness with the 26th to 30th 
Dynasties series confirms the conclusion already arrived at that as the dynasties 
advanced the population of Upper Egypt approximated more and more to the 
stable type of the Lower Egyptian people. 

The series of which the means are shown in the last three columns of Table XI 
appear to be of almost pure Lower Egyptian type although they came from Upper 
Egypt. The Coefficients between them and with three other series to which they 
are closely related are given in Table XII. The Ptolemaic and 18th Dynasty Abydos 
series are almost identical, while the Roman is very similar to the former but 
distinct from the latter. All three are closer to the First and Second Dynasties 
Royal Tombs series, than to any other of the Lower Egyptian type and the skulls 
of the 18th Dynasty from Abydos are the only ones that show very close affinity 
with those of the Fourth and Fifth Dynasties from Deshasheh and Medum in 
Lower Egypt. 

The Fourth and Fifth Dynasties series was shown to be of absolutely pure Lower 
Egyptian type for all characters except the basio-bregmatic height and the index 
100 b/H’. The 18th Dynasty Abydos series, though still inclining slightly towards 
the Upper Egyptian stock, is also primarily of the Lower Egyptian type and it too 
is characterised by a large H’. We can only suppose that there was a very slight 
alien element in both those populations. We may intercalate them between the 
other series to give the more or less orderly sequences shown below for all characters 
except H’ and B/H’, but the arrangement in Fig. 1, p. 2, more accurately represents 
their relationships. 


100 100 100 H’ 
B/L | B/H’ | NB/NH'| 


Ptolemaic Period 


hes 135°3 | 73°7 | 102°9 | 49°3 | 131°5 
18th Dynasty Abydos oes | eae we 73°9 102°3 148°8 | 134°2 
Roman Period ... nine nae wee 136°4 75°2 104°4 19°4 130°7 
Ist and 2nd Dynasties Royal Tombs ... i37°7 74°9 103°7 18°5 132°8 
26th to 30th Dynasties ... 138°9 75°1 104°9 17 °4 132°4 
ith and 5th Dynasties ... 139°3 | 75°4 102°5 18°2 136°0 








There is a considerable difference between the 18th Dynasty skulls from Abydos 
and the contemporary series from Shekh Ali, the Coefficients between the two 
being 4°87 +°13 for 14 characters and 0°73 +°19 for six indices and angles. The 
Fourth and Fifth Dynasties and the 18th Dynasty Abydos series are the only two 
measured by Thomson and MaclIver which do not represent intermediate types 
between the two races which populated Upper and Lower Egypt in the earliest 
times of which we have record, without the slightest infusion, as far as we can tell, 
of racial elements foreign to Egypt. 


’ 
1 


be Upper Egyptian Series later than the Fifth Dynasty measured by Chantre 
and Broca*, This material has all the defects of Chantre’s prehistoric series (s 

* We are unable to make any use of two papers by Pruner-Bey (17) and Biasutti (18) that are 
frequently referred to. The total number of skulls dealt with in each is considerable, but when divided 
into geographicai and time groups they become quite inadequate. 
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TABLE XIII. 


Mean Measurements of Upper Egyptian Series later than the 
5th Dynasty measured by Chantre and Broca*. 

















Period llth Dynasty 18th Dynasty | 18th Dynasty 26th Dynasty Ptolemaic 
|-— ——- | oe" 
| Locality | sboa't Negesh) Gournah | Thebes Gournah Gournah 
Measured by Broca Chantre | Broca Chantre Chantre | 
| Number of Skulls 18 20 | 18 20 25 | 
|\—_—______—_- = —_—|—————— CWI ———— 2 aaa 
| | 
| L 184 184 182 2 180 | 
| B 36 135 | 136 13¢ 137 
| i” 129 131 133 -—— 
3” 93 95 91 97 94 
J 130 129 126 130 127 
| DC 29 25 26 23 23 | 
| 0! 37 38 38 40 39 
0s 33 33 31 3: 32 
VH’ 5] 52 1) 52 49 | 
NB 24 26 25 26 25 
ful 35 3 34 3 3 | 
fimb 29 29 29 9 30 
U one ane | 510 — 
100 B/L 73°9 73°4 | 74°7 747 7671 | 
100 H’/L 70°71 71:2 73°71 — 
100 B/H’ 105°4 103°1 102°3 — — 
100 (B-A')/L 3°8 22 «| 1°6 
100 0/0,’ 89-2 86-8 81°6 82°5 81°] 
100 VB/ NH’ 17°] 0°70 10 50°0 51:0 
100 finb/fml | 82-9 853 85°3 85°3 88-2 | 
| 


All the indices in this Table were calculated from the mean values of the component lengths. 


footnote || to Table I, p- 14). [onoring the shorter series, there are five available 
ranging from the 11th Dynasty to Ptolemaic times. The mean measurements given 
in Table XIII were taken from various tables in (15) and (16), p. 507. All the series 
appear to be intermediate between the Upper and Lower Egyptian types: the 
breadths and cephalic indices arrange them in orders which are almost identically 
the same as the chronological order. As far as we can tell all the series are pure 
Egyptian and their relationships seem to confirm the hypotheses that were deduced 
from more adequate data. 


J 


8. Upper Egyptian Series later than the Fifth Dynasty measured by German 
Craniometricians. These are of very considerable importance because the majority 
are long series and an adequate number of measurements is provided for all of 
them. The individual measurements of two of the most important have been 
furnished by Dr Emil Schmidt in the Leipzig portion of the German Anthro- 
pological Catalogue(19). The longer (loc. cit. pp. 57—117) was reduced by 
Professor Karl Pearson and Dr Alice Lee and the means were first quoted in 
Faweett’s paper ((6), p. 426). The skulls came from Thebes and belonged to the 


3—2 





Date 
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| 


} 


Locality 


| 


Measured by 


100 B/L 
100 H/L 
100 B/L 
100 B/H 


100 H’/L 
LOO B/H’ 
B-H' 
100 GH/GB 
NH 
NH' 


100 


100 VB 
100 NB 
100 0,/0 


100 0, 0; 
100 fmb/finl 
LOO G,/G4/ 


Pé 


ems. 


LOW 


L 


Middle 
Dynasties 


|E 





North 


| Toldt 


1355°O (27) 
182°2 (37 
181°8 (37) 


134°2 (37) 
90°4 


Q> 








37) 
135°9 (35 
116°1 (36 
101°6 (34) 
372°7 (33 
126°0 (36 
130°9 (33 
115-1 (30 
97°3 (30 
505°6 (35 
70°1 (29 
o4- 9 ( 29 
26 
49°8 (37 
24°8 (36 
23°4 (34 
10°8 (36 
38°1 (35) 
32°3 (35 
16°4 (25 
96°5 
34°9 
29°2 
73°7 
72." Qr 
ind vi 
74°7 (35 
98°7 (35 
O°9 (35 
73°9 (29)! 
19°9 (36 
179°2 (35 
84°9 (35 
84°0 (3] 
85°°6 28 





0 we Ed 


| 








—_———_| 


Marly and 
Middle 
Dynasties 


South 


Toldt 






aninientinbishetens 
1366°1 (67 
182°7 (70 
182°5 (66 
135°9 (70 
92°7 (70 
132°9 (65 
115°0 (66 
100°1 (63 
371°8 (69 
128°2 (69 
127°8 (67 
116-0 (66) 
97°8 (66 
106°2 (69 
70°2 (62 


19°4 (65 
25°O (63 
23°3 (59 
£1°3 (66 
38°4 (64 
32°8 (66 
17°O (55 
98°1 (55 
34°7 (64 
28°9 (64 
74°5 (66 
74°4 (70 
72°9 (65 
1102°3 (65 
{16 (65 
{73°7 (57 
10'S (63 
{79-4 (66 
85°7 (64 
83°4 (63 
84°°9 (58 
167°°5 (55 
{71°°O (55 
141°°5 (55 
127 6 (55 
113°°9 (55 
{62-0 (66 


1-Kubanieh E!-Kubanieh 








Mean Measurements of Middle and late 


TABLE 





XIV. 








910 
Mode 18th to 21st 
| Dynasties 
Abyssinian 


Tigre Dist. 


———$— 


S. Sergi 





Thebes 





Schmidt 


| 1387°6 


183 695 181°9 
136°3 (65 136°6 
94°6 (65 93°8 
134°] 
133°5 (64 | 131°9 
112°6 (65) } 114°3 
99°8 (64) | 100°6 
309°8 (65) | 306-1 
372°7 (58 372°4 
130°2 (65 — 
128°9 (65 
113°7 (57 _— 
95°8 (57) 96°5 
9130 (65 10°8 
69°1 (65) 68°8 
93°1 (57 95°7 
125-0 2) | 128°3 
| 50°61 
18°9 (64 [50-4 
25:0 (63 25°26 
22°0 (65) — 
9°53 (64 os 
40°0 65) _ 
[37°3 (65 } 38°14 
33°2 (65 32°96 
16°25 
— 10°83 
95°2 (64 96°41 
36°1 57 35°10 
IQ 9 29°80 
a {737 
(74:2 (65)}| 75-07 
aa f101°9 
{72°7 (64)}| {72° 
f102°1 64)! | {103-6 
{15 (64)} {2°6 
{74°2 57)? 7 2 
50°07 
{511 (63)} | {50-1 
{83:0 (65)! — 
189°O (65)! 86°5 
{81°7 57 ; 85°50 
= 88°5 
85°°5 
(65°°4 (64))| {66°-] 
{73°°2 (64)!| {73°-0 
(414 (64)! | {40°-9 
{28° °4 
—_ | 712° 
161°9 (57)} 
605°2 (65 


—$ 





Dynasties 


Thebes 


Stahr 


164) 1451°0 (50 
(169) | 183°5 (: 57 
| 
169) | 137°3 (56 
(168 94°9 (59 
169) —_ 
168 133°7 (54 
(169 114°1 (56 
169) | 101°9 (56 
169 — 
168) | 372+] 1) 
125°7 (54 
129°8 (50) 
| 114°7 (54 
166 96°1 (54) 
168 12°6 (54 
169 70°1 (56 
16] 95°5 (54 
166) | 128°6 (53 
169 — 
169)]| 50°8 (56 
166 25°6 (53 
| 21°1 (52 
10°8 (52 
169 38°3 (59 
169 32°8 (53 
160 16°2 (43 
151 
167) | 9671 (53 
168 35°2 (55 
165 29°74 (54 
169)! 
169 74°8 (56 
169)} 
168)! | {72-9 (54 
168)! |'102°7 (54 
168)! 12°0 (54 
5! {73°4 (54 
166 ao 
166)? 190°4 (53 
169 185°6 (53 
165 184°4 (54 
151 —— 
17 85°°2 (55 
167)! | {64°°5 (5% 
167)! | {74°°O (53 
167)} | {41°°5 /53 
47) | {30°°3 (53 
17)} | {11°°2 (53 
161°4 (54 
598°4 (48 


| 18th to 20th | 








18th 
Dynasty 


-_—_——— 


Thebes ? 





Schultz and 











Oetteking 
1393°8 (21 
185°1 (25 
137°6 (25 
95°1 (25 


134°3 (24 


133°9 





38°3 (18 
33°0 (24) | 
17°5 (23 | 
$1°6 (2] 
96-0 (25) | 
36°4 (25) | 
30°7 (25 
} 





2! ti] 
25)} | 
25)! 
23)} 
1 "9 (23 i 
(60°6 (25)! 





Dynastic Series of tn Crania. 
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18th 
and 19th 
Dynasties 


Abydos 











Males. 


26th to 30th 
Dynasties 


Gizeh 





Davin and 





| 








Schmidt Pearson 
Se ee ie 
1413°7 (25) |1438°9 (753 
183°1 (33) | 185-3 (895 
: | 184-6 (879) | 
139°2 (33) 38°9 (896) | 
96°2 (33 94°8 (935) | 
132°4 (28) | 134-1 (884 
130-9 (33 [132-4 (884)]| 
113°5 (33 113°4 (887 
100°7 (3 101°6 (896 | 
a 1 (32 307°2 (878) | 
3745 (33) 371-9 (884) | 
—_ 127°4 (928 
— 128-5 (908) | 
115-2 (889 
96°3 (31 | 94°7 (885 
517-1 (33) | 518-7 (876 
71°3 (30) | 70-4 (g45) | 
95°9 (25) | 95-2 877 
131°3 (33) | 128-7 (785 
52°1 (31 51°65 (898 
[51-9 (31)]} [51-5 (898 
24°9 (26) 24°36 (893) | 
24°2 (31 — 
— 11°13 (886 
38°7 (30) | [88-4 (886)]| 
33°5 (31) | 33°73 (ggg) | 
46°2 (26 16°28 (82) 
t1°1 (21 38°81 (703) | 
96°2 (31) 95°03 (832) | 
35°9 (31 35°56 (894 
30°2 (30 29°96 (905 | 
as 75°34 (866 
73°7 (33) | 72°46 (871 | 
76°0 (33) | 75°06 (884 
103°1 (33) | 103-70 (873 
71°5 (33 | {71°5 (884) 
106°4 (33) | {104-9 884)} 
(4°5 (33)}| {3-5 (884)?! 
(74°4 (25)'| 74°13 (893) | 
17°9 (30 | 17°31 (881) | 
(48°O (26)}| {47:4 (893)}] 
= | 82°12 (878 
86°8 (30 187°8 886)! 
84°3 (28 84°47 (889) | 
89°3 (20) | {83-9 (703): 
|} 85°°8 195 
{65°*2 (30)}] 63°°9 . (818) | 
72 3 30\\| 74°"1 (818 
(42°°5 (30)!! 41°9 (818) | 
— | 30°03 (795 
— 11°*89 (795 
= 61°54 (884 
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Notes to Table XIV. 

The indices and angles in curled brackets were calculated from the mean values of the component 
lengths. The lengths H’, 0,’ and NH’ in squared brackets were found from the similar measurements 
H, O,R and NH, RK on the assumption that the differences would be the same as for Motley’s very 
similar First Dynasty series (in Table V, p. 24, above). Those differences are very similar to values found 
for other ancient Egyptian series and, as far as we can tell, they are quite constant for all modern races 
of man. The capacities given by Stahr were probably determined by a method which would make them 
not comparable with the others. Tildesley (Biometrika, Vol. xm. p. 217) has shown that in taking 
auricular heights on the Ranke’s craniophor with the flute-topped plugs which are at present used in 
the Biometric Laboratory *, a correction should be made by subtracting 2°5 mm. from the scale readings, 
and that is confirmed by comparison of mean direct and type contour measurements. That correction 
has been made to Motley’s First Dynasty means in Table V. In the case of the series measured by 
Sergi and Toldt it is doubtful whether the nasal height measurement corresponds to our NH or NH’: 
the difference between the two is small. Schmidt took the length to the lowest point of the pyriform 
aperture, i.e, whichever is greater of NH, R and NH, L, so the mean would be very slightly greater than 
that of either NH, R or NH,L. Oetteking does not say on which side he took the measurement. The 
Occipital Index is a complex function of the are S; and the chord S;’, so it is not surprising that its 
mean value calculated from the indices of individual skulls should not be the same as the index found 
from the mean values of S; and S.’. For all races the former exceeds the latter and the differences are 
0°5 for the 26th to 30th Dynasties series, 1:0 for the Naqada A and Q series, 0-9 for the Naqada B, 7 
and R series and 1°1 for Motley’s First Dynasty series. It can be shown that the difference is not 
dependent on the number of skulls in the series so we may accept the mean of these four as a corrective 
factor for Egyptian types. The Occipital Indices in curled brackets in Table XIV were obtained by 
adding 0°9 to the indices deduced from the mean values of Ss; and S,’. 

In finding the means of the Abyssinian series, four juvenile skulls (Nos. 7, 13, 53 and 59) were 
excluded and one juvenile (No. 15) was excluded from Toldt’s El-Kubanieh South series. The GL length 
of Schmidt’s 18th to 21st Dynasties series was given as 95°79 (169); omitting one juvenile skull (No. 630) 
I find that it is 96°41 (167) and the alteration affects the angles of the fundamental triangle. 

Several mean measurements of the 26th to 30th Dynasties series other than those in Table XIV were 
determined: viz. ’=182°8 (895), Q@ =307-5 (876), O,; L=40°75 (880), O. L=33°83 (888) (the. O,, O, and 
100 0./O, in the table refer to the right orbit), G,;=49°70(817), 100 H/L’=72-72 (863), 100 02/0,, 
L =83-06 (876), 100 Gy/G = 78°36 (668). 





18th to 21st Dynasties. The means of the other series ((19), pp. 64—74), which 
belonged to the 18th and 19th Dynasties and came from Abydos, are now given 
for the first time. Schmidt gives quite complete measurements, following in general 
the Frankfurter Verstdndigung, but a few of them might have been defined with 
greater precision. The means are given in Table XIV. Both series are pre- 
dominantly of the Lower Egyptian type and they show very close affinities with 
some previously considered, 

When comparing the Coefficients of Racial Likeness calculated for several 
series of different lengths it is of the utmost importance to take into consideration 
the number of skulls on which each series is based. If that is done the five series 
of Lower Egyptian type shown in the upper part of Table XV are arranged by 
their inter-relations in the order shown. Schmidt’s Abydos type stands very close 
to those of the 26th to 30th Dynasties and of the Royal Tombs of the First and 
Second Dynasties, but his longer series from Thebes of the same date is far closer 
to the Roman Period skulls. The former is of quite pure Lower Egyptian type and 
the latter diverges slightly from that race in the direction of the Upper Egyptian. 
The latter point is brought out clearly by a comparison with the five types shown 
in the lower half of Table XV, from the 18th Dynasty Shekh Ali to the Early 

[* Earlier measures were taken with the pointed plugs for which the craniophor is scaled, and I 
believe there is not adequate ground for modifying their values as Tildesley proposed.—Eb. ] 








Schmidt (29-9) Schmidt (162-4) 
All Characters Indices and Angles All Characters | Indices and Angles 
C.1 Number of | Number of 0.1 Number of CI Number of 
Characters - Characters Par Characters vies Characters 
9G 2 asties | 
ae 30th Dynastic 2-01 +09 27 | 2804-16 9 20°55 + 09 28 24-57 +15 10 
(885° = + + | = 
- ‘eg De aime ; 9-49 +09 | 29 41415 10 
Schmidt (29° | 
ts 2 Yynasties ; | ie ol - : 
we” a p —" O81 +13 id 1324-19 6 374-13 id 027 +19 6 
vova ombs ys 
Roman Periods* (49°3) ... | 1°74+4°13 14 1-05+°19 6 1°23 +°13 14 O1L+:19 6 
: A aha 
18th to 2st Dynasties! | 9.494 09 9) 2414-15 10 me 
Schmidt (162-4 S| 
masty Royal Tombs) | | 
oe p Bam ul Tom | 2°77 +-09 29 214-15 10 TAS +-09 20 nO+-14 7 
MOTICY (H+ yA | | 
‘ § dynasties * 3 
i th ~Dynasties*) | 9694-13 4 | 2534-19 6 70413 M4 774-19 6 
8 masty Shekh Ali* = dericais . : a 
l — hekh Ali TAB +13 i4 | 7454-19 6 1-28 4-13) 14 38 4-19 g 
> 2 Yvnasties*) |... — | | ’ : | 
a Dynastic ()is734+13| 14 19-03 4-19 6 1844-13 14 19:16 4-19 6 
(LOS°6 | | 
— Tom 8-90 +°13 | 14 14-98 4°13 6 | 6-77+°13 14 9804-19 6 
(oe ) | | | 
Late Pre-dynastic* (88°5) | 13°66+°13 14 | 19°41 +°13 6 14:07 +°13 14 2-05 +°19 6 
Early Pre-dynastic*® (35-8) | 16°91 +:13 | 14 21°81+°19 6 13°67 +15 l4 19°55 +19 6 
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TABLE XV. 
Coefficients of Racial Likeness with Schmidt's Series. 


18th and 19th Dynasties Abydos, 





These series were measured by Thomson and Maclver. 


Pre-dynastic. They represent the earlier transitional stages from the pure Upper 
Egyptian type to the distinct one of Lower Egypt. The Coefficients with Schmidt's 
series are all of a higher order than the ones in the upper part of the table, but 
the order of arrangement is in both cases the same as that which was arrived at in 
considering the inter-relations of the earlier series themselves. It is quite reasonable 


to place the 6th to 12th Dynasties series before that of the Ist Dynasty Private 
Tombs since the disparity between the numbers of skulls in each is so great. 
Remembering that Schmidt’s 18th to 21st Dynasties series is far longer than his 
other, we may conclude that its relationships with the Upper Egyptian types, 
though remote, are distinctly closer than are those between the latter and Schmidt's 
Middle Dynastic series from Abydos. A detailed comparison of individual measure 
ments need hardly be made ; it would confirm the conclusions that have already 
been reached. The greatest values of a are almost invariably found between the 
characters 100 B/H’, B, 100 B/E and L, while H’, NZ and 100 NB/NH’ show 
differences that are smaller but still significant in some cases; and certain other 
characters are reasonably constant throughout. Nevertheless Schmidt’s Z and G’ H 


of the 18th to 21st Dynasties series are as small as for any other dynastic Egyptian 





18th to 21st Dynasties Thebes, 
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series and their values differ significantly from those of some closely related types. 
This alone would be very meagre evidence of alien admixture, but it will be shown 
later that there are better reasons for supposing that the series in question had 
diverged slightly from the pure Egyptian stock. 


In 1919 C. Toldt (20) furnished individual measurements of Egyptian skulls 


s that had been disinterred at El-Kubanich near Assuan. The site is 100 miles 
; from Thebes and.is more southerly than any other with which we are at present 

ro . e 

ters concerned. The whole material came from two graveyards and it was divided into 


the following groups (see (20), pp. 526, 635 and 636): 
ee ‘ ; (Karly Dynastic 62 skulls. 
El-Kubanich South {00.0% : Baste. 
( Middle Dynastic 52 skulls. 
nie 2 : ’ Middle Dynastic 31 skulls. 
El-Kubanich North : : gen 
{| Nomads 26 skulls. 
The nomads in the second group were supposed to be related to tribes of the 


northern Sudan and they apparently belonged to Middle Dynastie times. Un- 


fortunately it is not possible to separate them from the other skulls from the 
El-Kubanich North cemetery. The means of the two groups reduced from Toldt’s 
individual measurements are given in Table XIV. The Coefficients between them 


and the most closely related series are in Table AVE: 


TABLE XVI. 


Coefficients of Racial Likeness with Toldt’s Series from el Kubanieh ll 





6th to 12th | El-Kubanieh , 12th to 15th | El-Kubanieh| 18th Dynasty| 


Dynastic North Dynasties | South Shekh Ali 

(168°6) (33-1) (65-9) (63°5) | (40-0) 
el-Kubanieh | 14 Characters “ 590413 | 3°244°13 OF+°13 86+ -°13 
South (63°5 6 Indices and Angle 7°78+°19 | 6°33+4°19 P6lL+ IY rBO# YD 
El-Kubanieh | 14 Character . OSL +°13 141+ °13 3°24+°13 169+ °13 
North (3371 6 Indices and Angles Ol44+°19 2°GO4+°19 | 6B3+°19 GOT +19 


* All the Coeflicients in Table XVI are calculated for the 14 characters, of which 6 are indices and 
angles, available for all Thomson and Maclver’s series. Between the two El-Kubanieh series 28 


characters can be used giving a Coefficient of 1°60 + -09 and of 3°16 4-14 for the 11 indices and angles. 


It is clear that the El-Kubanieh North and South series are very similar to the 

Middle Dynastic people of Upper Egypt: they take up the positions shown in 

i Table XVI and Fig. 1. The former is the later in time and yet it has been less 
changed from the Upper Egyptian type of the First Dynasty than the other and 

the difference is doubtless to be attributed to the nomadic clement. In calculating 

the Coefficients in Table XVI the greatest values of a were found for the characters 

100 B/H’, B, 100 B/L, H’'", Nz,1007'/L and NH’. The prominence of the last 


two 1s rather exceptional, The El-Kubaniech North series, but not the other, has 
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an unusually large basio-bregmatic height and height index, but other characters 
are not distinctive from other Middle Dynastic types. The Coefficients between 
Toldt’s two series and Schmidt's Theban skulls of the 18th to 21st Dynasties are 
given in Table X VII. 
TABLE XVII. 
Coefficients of Racial Likeness between Schmidt's 18th to 21st Dynasties 
and Toldts El-Kubanieh Series. 


18th to 2lst Dynasties Schmidt (162-4) 








All Characters Indices and Angles 
CI | Number of C Number of 


| Characters Characters 








El-Kubanieh North (33:1 wee | 4°50 + *09 27 6G l4+°15 10 
El-Kubanieh South (63°5) ; 2°65 + ‘09* 27 3°27 + °15% LO 
18th Dynasty Shekh Ali (40°0) ... | 4°284°13 14 38 +°19 6 














When these Coefticients are computed for the 14 characters, of which 6 are indices and angles, 
available for Thomson and Maclver’s series, they are increased to 3:51 +::13 and 4°23 + -09 respectively. 

That the El-Kubanieh South series should be closer to Schmidt’s 18th to 21st 
Dynasties than the El-Kubanieh North is quite consistent with the relationships 
already found, but that it should also be closer than the 18th Dynasty series from 
Shekh Ali is unexpected. This discordance might be attributed to the fact that 
Toldt’s series was heterogeneous in respect to time, but it will be shown later that 
a more probable explanation is that both it and Schmidt’s 18th to 21st Dynasties 
series were intermixed very slightly with an alien stock. The positions shown in 
Fig. :, p- 2, of the El-Kubanieh relative to the other series are the most reasonable. 

We regret that it has only been possible to make slight use of Dr Bruno 
Oetteking’s paper (22) on ancient Egyptian crania. He has not sexed his material 
and therefore it cannot be used for comparative purposes. It is strange to find 
a craniological study as late as 1909 in which the writer is content to take means 
and variations on a population without distinction of sex. The Egyptian skulls 
measured by Dr Oetteking are in the University of Ziirich and in 1918 Dr Adolf 
Schultz published measurements of 35 of them (25 ~ and 10 ?) for the purpose of 
a special enquiry ((23), measurements on pp. 72—79). Accepting that sexing, the 
means of the 25 specimens were reduced from Schultz and Oetteking’s tables and 
they are given in Table XIV. The material was probably Theban of the 
18th Dynasty. The Coefficients of Racial Likeness are given in Table X VIII. 

The short Ziirich series is of almost identically the same type as Thomson and 
Maclver’s of the First and Second Dynasties. It takes up a position between the 
latter and Schmidt’s series of the 18th and 19th Dynasties (see Fig. 1, p. 2) and it 
gives another representation of the wide-spread population of Lower Egyptian type 
which was predominating in Upper Egypt in Late Dynastic times. There is not the 


slightest suggestion of alien admixture. 


































TABLE XVIII. 


Coefficients of Racial Likeness with Schultz and Oetteking’s Zitrich Series. 


Ziirich Series (23-9) 

















| All Characters Indices and Angles 
: Number of Number of 
| C. L. Characters C.L Characters 
‘eonaenanes = — 
| 18th and 19th Dynasties Schmidt (29°9) 0-51 + °09 29 1°25+°15 10 
Ist and 2nd Dynasties Royal Tombs (21°8 0°13+°13 14 0°25+ "19 6 
| 26th to 30th Dynasties (885°4) ... .-- | 2°02+°09 31 1844-14 12 








In 1907 Dr Hermann Stahr(21) published full individual measurements of 
115 skulls from Thebes belonging to the period from the 18th to 20th Dynasties. 
The means of the 56 skulls given in Table XIV were reduced from Stahr’s tables ; 
not a single one is given in the original paper. The series appears to be quite 
a homogeneous one and it is of almost pure Lower Egyptian type. The Coefficients 
of Racial Likeness between it and the series which it most closely resembles are 
given in Table XIX. 
TABLE XIX. 
Coefficients of Racial Likeness with Stahr’s 18th to 20th 
Dynasties Series. 








| 18th to 20th Dynasties Stahr (54:1) | 
| All Characters | Indices and Angles | 
| | piss ace Gon 
| C.I Number of | C.I | Number of 
| ii | Characters | we | Characters 
x ee a ee. tom ae 
Roman Period (49°3 = bis aa 1°42+°13 14 0°62 +°19 6 
Ist and 2nd Dynasties Royal Tombs (21°8) | 0°91+°13 14 1-444 °19 6 
| 18th Dynasty Schultz and Oetteking (23°9 ‘iets 28 Webeot Ll 








The bond between Stahr’s series and that measured by Schultz and Oetteking 
is closer than that between any other two series dealt with in this paper. That is 
not surprising as they were contemporaneous and both came from Thebes. 

For the series dealt with in the Biometric Laboratory and those measured by 
German craniometricians many more characters are available than the 21 provided 
by Thomson and Maclver, The significances of the differences of the additional 
characters that are used in the computation of the Coefficients of Racial Likeness 
can be gauged in the usual way by considering the values of a. No very detailed 
comparison need be made here: it entirely confirms the conclusions already arrived at. 
In the following list (Table XX) the types are arranged in order from the almost pure 
Upper Egyptian type represented by the Naqada crania to the pure Lower Egyptian 
type of the 26th to 30th Dynasties series. The intermediate series, all of which 
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came from Upper Egypt, represent the more or less gradual transition from one 
extreme to the other. Where a character does not differ significantly between the 
Upper and Lower Egyptian types it is always found that no one of the intervening 


types differs significantly from either of the extremes. 


Where, however, the 


characters of the Upper Egyptian type do differ significantly from those of the 
Lower then, for those characters, there is a more or less gradual transition from 


one extreme to the other when the series are arranged in the order shown, which 
is the order in which the Coefficients of Racial Likeness arrange them. For some 
characters Motley’s First Dynasty series is quite clearly differentiated from the 


others. 


The length characters S, S;, 


’ 


Y ¢ 
S., Ss, 


17 
Ds, 


G, and fml, the facial breadths GB and 


the foraminal indices, profile angles and occipital indices of the Upper and Lower 


Egyptian and all intermediate types seem to be identically the same. 


Small 


differences are found between the characters associated with facial heights (O., 


1000,/0,/ and 100G’H/GH) and 
change in calvarial breadth (B’, U, Q’ and C), and the gradual transition from the 
Upper to the Lower Egyptian type can be traced more or less clearly in the case of 
the latter. In general a comparison of these additional characters entirely confirms 


Oo 
o 


ry 


‘eater differences between those influenced by 


the relationships of the two extreme types already noted. The Lower Egyptian 
race is characterised by its much 


sag 
o 


greater 


5S 


calvarial and zygomatic breadths and 
cephalic index and its slightly greater facial heights, while all lengths in the 
ittal plane and facial breadths other than the zygomatic breadth are the same 


for the two races. Some mean measurements of the First Dynasty skulls of th 


Royal Tombs from 


type was primarily 


Series 


Naqada A and @ Series 
El-Kubanieh North 
El-Kubanieh South a “ae 
i8th to 21st Dynasties Schmidt 
18th to 20th Dynasties Stahr ... 
18th Dynasty Schultz and) 

Oetteking j 
i8th and 19th Dynasties Schmidt 
26th to 30th Dynasties 


Ist Dynasty Royal Tombs) 


Motley J 
Modern Abyssinian, Tigre dis- ) 


trict j 





Abydos are 


5 


given in 


Table XX. 


Lower Egyptian, though inclining slightly towards the Upper 


It 


TABLE XX. 


Comparison of Additional Mean Measurements. 


Constant Characters 


— x OO =] 


95°9 


100 100 
0/R| B | l Oh G'H|GB | 0/0, ; 
39°1 | 92°6 510°4 | 32°2 82-4 1388°6 
38°1 | 90°4 | 505°6 | 32°3 84°9 1355°0 
38°4 | 92°7 | 506°2 | 32°8 85°7 136671 
38-1. 93°8 510°8 . 33°0 86°5 1387°6 
38°3 shel 2G | 32°8 85°6 1451°0 
38°3) 95°1 | 512°8 | 33°0 73°1 86°2 1393°8 
38°7 | «96°1 | 517°] 3°5 74°4 86°5 1413°7 
38°4 vidi anid 3°7 74°] 87°8 1438°9 
| l 
10°0  93°0 | 513°5 | 33° 76°8 | 86°1 | 1364°3 
we | 
94°6 | 513-0 | 33°2| 74:2 
| 
| 


has been shown that that 





Characters showing significant differences 
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Egyptian type. The majority of its measurements, including all those of the 
calvaria, are intermediate between the measurements of the Upper and Lower 
Egyptian races, but others clearly differentiate the First Dynasty skulls from all 
other Egyptian types. The greater facial heights G’H and NH’, but not O,, are 
the most distinctive characters, but BZ, GB, O,'R, Gj and G, are also greater 

than for any other series and the indices 100 G’ H/GB and 100 fmb/fml diverge 
from the normal type. Motley’s series is undoubtedly not pure Egyptian, but we 
are at present unable to say what other race took part in its make up; it was 
certainly not negroid. 

In the last row of Table XX there are some mean measurements of modern 
Abyssinian skulls; the complete measurements are in Column 4 of Table XIV. 
The data are provided by Dr Sergio Sergi(26). The series with which we are 
concerned came from the extreme north of the country—the Tigre district— 
and it is distinctly different from two other shorter ones from Central Abyssinia 
and Somali respectively*. There is a close relationship between the Tigre skulls 
and Middle Dynastic Egyptian types. 

When it is remembered that an interval of well over 3000 years must have 
separated the Abyssinians from any of the dynastic skulls with which they are 
compared in Table X XI, the Coefficients appear to be surprisingly low. The bond 


TABLE XXL. 


Coefficients of Racial Likeness with Modern Abyssinians (Tigre District). 


18th to 21st * - 18th El- 9 ra 
Dynasties —— Dynasty Kubanieh = —— 
Schmidt mon) Shekh Ali! South 65-9) 
(162°4) Reo (40-0) (63°5) (60°% 
= ee : . : 
au a aa 
Moder 14 Characters ...| B48+°13 | 6024-13 | 2-104°13 | 1164-13 | 3-72+°13 
saab 6 Indices and Angles! 1°764°19 | 2°014°19 | 2°494+°19 | 3°274°19 | 3°244°19 
Abyssinian = = = = 
(Tigre | —— "i 
District } = 
61°8 | 23 Characters 3°97 +°10 ~ — 187+°10 — 
| ; x 8 Indices and Angles eIS+ 17 2°46 + °17 | = 
with the El-Kubanich South series is at least as close as that between the 
Moorfields and Whitechapel English series of 17th century Londoners+. At the 
same time we should have anticipated even lower Coefficients if the Abyssinians 
| There is not the slightest suggestion of negroid admixture in the series from Central Abyssinia and 


Somali. 
+ These series are in Biometrika, Vol. v. p. 92. The Coefficients between the Whitechapel (87-2) and 
Moorfields (24-2) English are 205+ -09 for 26 characters and 1-73-+°16 for 9 indices and ee We 
have good reason to suppose that the Coefficients between the Abyssinian and El-Kubanieh Seuth series 
should be even lower than the values shown. The greatest differences between them are for NZ and AL 
and for these characters the modern series is similar to all middle dynastic ones and it is very probable 


| 
that the aberrant values given by Toldt are due toa difference in method of measurement, If NZ and 
Ac are excluded the two series have Coeflicients of 0°95-+--10 for 21 characters and - 0:13+°19 for 





6 indices and angles. 
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were of pure ancient Egyptian type. In Table XXI the series are arranged in the 
order given by their own inter-relation but their degrees of resemblance to the 
Abyssinians do not arrange them in the same order. The Coefticients with 
Schmidt’s 18th to 21st Dynasties series are decidedly lower than we should have 
anticipated. It is probable that both that series and the El-Kubanich Scuth were 
slightly mixed with an alien stock which had a greater effect on the later 
Abvssinians. The only individual characters which distinguish the latter from 
the normal middle dynastic type of Upper Egypt are its small NH’, G’H, LB and 
GB, and the nasal index being peculiarly large and the foraminal index peculiarly 
small, And for those characters the El-Kubanich South series and Schmidt’s of 
the 18th to 21st Dynasties show a tendency to diverge from the normal in the 
direction of the Abyssinian values: the VH’ and 100 fmb/fml of the former and 
the G'H of the latter are smaller than for any other dynastic series*. 

9. Type Contours of First Dynasty Skulls from Royal Tombs at Abydos. The 
horizontal, transverse and sagittal type contours of the First Dynasty skulls from 
royal tombs at Abydos (see p, 22 above) were constructed by Mr G. H. Motley from 
the mean measurements of contours of the individual skulls in the manner commonly 
practised in the Biometric Laboratory+. The mean measurements are given in 


Tables XXII, XXIII and XXIV and the outlines in Figs. 3—8. The ¢ contours 
TABLE XXII. 
Mean Measurements of the Transverse Vertical Contours of 32 £ and 10 ¢ 


First Dynasty Skulls from Royal Tombs at Abydos. 


1 





| | | } | | 
| 
Sex MA | un IL | M4R|M UL | 2k | 21 aR IL | AR 1D, | BR 51 | 6R | 6L 
a Se See a ee en ee ee a a a ee 
| | | | | 
t L10°6 57°1 60°0 | 59°3 60°5 | 6O'L | 62°9 | 62°7 65°3 | 64°6 | 66°1 | 64°9 | 66°0 | 64°7 
. 108'2| 54°3 57° 56°5 | 58°] | 57°7 60°9 | 60°4* | 63°1 | 62°4* | 64°3 | 62°9 | 64-4 | 62-9 
| | 


| 
| | ZR.R 
TR | 71 8R | 8L | 9R | 9L | 10R | 10 | 44R 


| 1 
| | 








——|— ae ie Seer ee cee eee Pee Pe 


=. 65°1 | 63°3 | 61°6 | 59°2 | 54°0 | 50°5 | 39°4 | B6-0 


| 
| "4 | 63:0 | 60-9 59°7 | 56°41 52°0 | 48°0 | 37°5 | 34°4 | 18°9 | 17°6 | 57°4 











These lengths are the means of 9 contours. 


may be compared with the types of 100 / skulls from Lower Egypt which formed 
part of the long 26th to 30th Dynasties series (see p. 29 above) and of which the 
three type contours were constructed by Benington in 1911 (Biometrika, Vol. vit. 


* The Abyssinian and the El-Kubanieh South series are the only two with which we are dealing to 
have a greater frontal (S,) than parietal arc (S.): see Table XIV. These measurements are unfortunately 
not available for Schmidt’s series 


¢ For a full account of the method used see Biometrika, Vol. xtv. pp. 227—241. 
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173 


pp- Let). 
above the nasio-gamma base line, so there is no representation of the facial bones 


The sagittal contour of the latter series was only given for the are 
and base of the skull. On superposing the outlines, the differences between them 
are seen to be very small : 
This 


conclusion is in accordance with that derived from a comparison of direct measure- 


their lengths, so we cannot say that any of the differences are significant. 


ments as the only significant differences which could be distinguished in that way 


were those between facial measurements. 


TABLE XXIII. 


Mean Values of the Horizontal Contours of 32 £ and 10 2 First Dynasty Skulls 
From Royal Tombs at A bydos. 



























































the type zones would actually overlap for the whole of 























{ j | } | 
FO |F1R| FAL \K rl F4L | 2R | 20 |23R| 240 | | 3R | 31 | am | 40 | SR | SL | 
| Loa | | oe, 
tt | _ 
183°9 | 23°0 | 23°5* | 34°7 | 35°54 | 15°9 | 47°24 | 46-1 | 46-8* | 47 ‘9 | 48°Ot | 53°7 (| 53> | 61-0 | | 60°8* 
177% | 20°7 | 21°9 | 32°9 | 34°6 | 14°3| 45°8 | 44-1] 45-0 | 46-0| 46°6 | 51-93 hey | 58-9 | 58-8 
| | | | | 
' | 3 2 | Pa | | 
6R 61 Th iL 8R 8L 9R 9L | 10k | 1OL |ORR| ORL! T(R)r| T(L)2z | T(R)v| T(L)y 
q os | | | 
Jos ph ag ON UE UN JR a Fox Seek 
| 65°8 | 65°2* | 66°9 | 66°6* | 64°3) | 64°0 | 57-7 | 57°8 45°1 45°8 | 24-4 | 24-9] 20°5 | 19-9¢| 47°9 | 48-9+ 
| 63°6 | 63-9 | ej jee io 64°3 | 56-9 | 58-4 | oat 1G-4 | 25-5 | 25+ | 19-9 | 19-7 15-9 | 47-2 | 


These lengths are the means of 31 contours. 


Sh ++ 


TABLE XXIV. 
Mean Measurements of the Sagittal Contours of 31 ¢ and 10 2 First Dynasty 
Skulls from Royal Tombs at Abydos. 




















Ordinates above Ny 
Sex Ny = iia — —— = = =. 
| | | } | 
O=N| Nj} 1 2 Si) 4 | 5 | 6 i 8 9 | vt | yt | y=10 
os he See ae a ee ae, oe . — * aa | 
} 
. ek ee Re ce 
182°8 | 15°4 3L°9 | 56°38 | 69°3 | 767 | 80°8 | 82-4 | 83-0 80°O | 70-4 | 514 | 25S] 188) 5°27 
> 611765] 15-2 | 36-2 | 56-5 | 68-4 | 746 | 78-2 | 80-2 | 80-0 | 76°3 | 67-3 | 46-2 | 23-3] 16-2) — 1-52" 
| | 
The negative sign here indicates that the 10th ordinate meets the outline below the Ny line on 


the type. 
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TABLE XXIV—(continued). 








Ordinates below Ny Vertex B Glabella | 
Sex i — 
O=N| N} 1 2 8 9 vi yi jxfromN| y j\xfromN| y j\xfromN' y 
} | 
| “A . _|- — eee ee | 
| 6 70°7* | 67°2 | 57°4 | 54°5t | 46°1 | 35°1 | 26°3 | 19°0| 103°8 | 83°4 75°0 | 811 1°2 9°7 
? 65°9 | 61°5 | 52°6 | 49°7 48°3 | 38°4 | 30°4 | 23°3 95°7 81°] 79°3 79°0 1-0 9°1 | 
ie 
N Sub-Orb. | Aur. | Op. Inion Basion | Aly. 
| | 
Sex |—— - - . a —|— 
« from | « from Fa from | |v from x from . f ,| from | from 
y N y San ae es y é y from *y | from N | Ww “a 
| y é y | Y Y Se 
| = = = aoe wees ATR 
| 
t 5°7 | 27°7| 10°9 | 29°1 | 94°1 | 28-1 | 57°O* | 53°3* | 14°3 | 32°1| 1066 | 1015 | 74°4 | 95°7 
1:8 | 22-2) 9°8 | 28°7| 90°9 | 27°9)| 56°0 | 55:1 14:3 | 35°5 | 102°5t | 98°7t | '70°1}|93- 
| | 
—= : = a pases — 
. : ; Max. Sub. of | 
ose alate Sp. } N. 8. y x] ‘ 
Nose Palate p N.S ‘My he Chow 
Sex Pp’ P 
, re is « from a from x from 
“) (4) ) | & from v from N y N ¥ N y 
N Y Alv. y 
t 102+ | 3°Ot | 6°3* | 49°8t 51°5t | 36°1t | 19°6t | 67°0 | 34:0 64 | 54°1 3°4 62°5 
0°82 2°9 67 18°0 48°0 | 38:2 | 186 | 64°6f | 32°8 6°0 19°8 1-0 54°9 
| “ oor | | 
Frontal Max. Occipital Max. | Max. Sub. Max. Sub. 
Sub. to N8 Sub. to \ Op. to NX to G.I. 
| Sex 
: ; ; i= 
| v from N y v from \ y v from N y | 2 from G y 
; | 
| t 50°9 24°6 15°2S 26°88 84°9 68°7 104°5 100°0 
| § 50°2 26°7 12°5 26°2 81°9 69°2 10070 | 98-4 


These lengths are the means of 29 contours. 
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Fig. 3. Transverse Type Contour of 32 3 First Dynasty Skulls from Abydos. 
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Fig. 4. Transverse Type Contour of 1092 First Dynasty Skulls from Abydos. 
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Fig. 5. Horizontal Type Contour of 32 3 First Dynasty Skulls from Abydos. 
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Fig. 6. Horizontal Type Contour of 1092 First Dynasty Skulls from Abydos, 
Biometrika xv11 
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+ First Dynasty Skulls from Abydos. 


Fig. 7. Sagittal Type Contour of 32 
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Lengths Arcs 
r= ae — —————_—_____A eS 
ce at oe Bah eee =I a2 ] a l 
Ne. Oe a a ee L Bo) oes | we H’ | OH | LB Q Q’ Ss S, 
| | | | | | | | 
as a ee te are ee 4 es 
Males | | | | | 
c 2 ——— = - | | | 
I31 1250 | 596 174°5 a77°5 170°3 | 130°5 | 94°9 | 131-3 | 131-6 | I12°7 | 1020 | 302-0 20°5 
132 — | 535? | 1769 a 180°7?| 137°1 95°3 — | 118-7?) — ror'r | — 17°5 
146 1438 | 954°5 I9QI°5 | 193-2 I9QI'5 144°4 99°90 | 145°O | 143°6 | 120-2 108-1 322°0 34°5 
153 A | 1639 | 860 | 1969 | 201-0 200°4 | (34°8 94°0 | 145°8 | 146:4 | 120-3 | 110-2 322-0 28-0 
205 1276 | 600 | I8t-r | 186°8 | 185°5 | 136°4 8Q°2 | 127°8 | 127°0 109°8 | 100°6 | 295°5 26°0 
229 | 1399 | 558°5 | 184°8 | 187°5 1572 | 136°0 93°O | 125°5 | 124°O | 112-0 | ror-4 | 309-7 25°5 
254 1278 | 824 183°2 | 1840 | 183-8 | 132-6 99°I | 129°0 | 127°6 | 113-9 | 93°9 | 303°5 29°5 
: a@ | eine Done | > 7s . ° | ° | “7 . . c 
390 1478 | 589°5 | 187-9 | IQI‘O | IgO'I | 147°0 93°2 | 134°9 | 133° | 113°7 | 103-4 31370 21-0 
| 
413 1297 | 673 176°0?| 179°1 | 178°9?| 132-6 O4°I | 142-0 | 144°1?) 1171 | 1002 | 310°0 369°0 | 124-0 
426 1410 | 699 180-2 | 181°5 | 181-0 | 140-4 88-5 | 1340 | 133°5 | 113°3 | 100-0 | 308-5 126-0 
428 | ~ 184°3? 185°3?) 139°0 94°8 I2I4);, — | — | — — 
429 1323 | 616 181-2 184°3 | 1360 88-5 | 133°0 | 132-2 | 112-2 | 104°0 | 304°0 | 3 131-0 
, ' | 
32 1441 591 184:2 186-0 185°9 I40°7 | 93:0 137°0 1302 1150 100-0 | 309°5 309°5 381-0 131-0 
433 1350 | 762 184:2 | 187:2 | 186°5 | 139:0 97°O | 133°7 | 133°8 | 115°O | 105°8 | 313°5 | 315°5 372°0 | 130°0 
430 608 I82-r | 185 185°0 | 135°9 95°O | 137°O | 135°I | 1127 | ror-4 | 301°5 | 303° | 373°0 | 123°0 
= 8 32:0 | 32-09 ° 7 2 267 9 *Q | = ~ a 
442 1306 | 676 180-1?) 183-0 IS2°0?| 132°1 94°5 | 137°O | 130°72) II4°9 | 102-1 | 304-2 305°O | 370°0?) 124-0? 
} | 
| | | | j 
440 1438 | 651°5 | 184-1 | 185-0 | 185:0 | 138-6 88-4 | 138-0 | 137°5 | 120:9 | gg-0 316°3 | 320-0 | 381-0 | 131-0 
| | | | _ . 
| | 
147 1203 | 800 184°3 | 190°9 | 132-0 QO°O | 134°O | I3I*I | III-2 | 108-8 | 295-7 | 296-0 306-0 | 121-0 
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48 CRANIA OF THE FIRST DYNASTY (Lene 


REMARKS (See Section 2 of 


Teeth worn; P.B. downwards with sharp edge; SR; J equal; ossicle of \ and we 
All teeth except 6 lost; P.B. upwards with sharp, double edge; S equal; J equa 
Sagittal suture closing at obelion; 5 teeth lost; P.B. downwards with blunt edgq 
Aging; all sutures closing; teeth very worn and 8 lost; P.B. flat with blunt edge 
Aging; coronal and sagittal sutures closed; teeth sound but worn; P.B. flat with 
Aging; sutures closing; teeth worn; P.B. downwards with sharp edge; S equal; 
Aging; all sutures closing but teeth perfect; lower 3rd molar L. lost; P.B. flat w 
Aging; sutures closing; teeth worn but complete; P.B. flat with sharp edge; S 
inionic protuberance. 
Deep palate; P.B. downwards with rounded edge; S equal; J equal. 
Teeth sound but very worn; P.B. flat with sharp edge; SL; JR; flattened obelic 
Calvaria and mandible only; tympanic perforation L.; 8 equal; tripartite interp: 
Sagittal suture closing at obelion; teeth worn; P.B. downwards with sharp edg| 
epipteric bone R. 
Aging; teeth very worn; coronal suture closing; 6 teeth lost from upper jaw; P. 
Teeth sound but worn; P.B. downwards with sharp edge; S equal; JR; small ey 
No lower 3rd molars; P.B. upwards with sharp edge; 8 equal; JR. 
Teeth complete but worn; P.B. downwards with rounded edge; SR; JR; complex 
spine 10 mm. long below and to the left of the inion. 
Adult but no upper 3rd molars; teeth worn; tympanic perforation L.; P.B. upwag 
near asterions. 
Teeth worn and 4 lost; no lower 3rd molars; slight tympanic perforation R.; P. 
Upper 3rd molars lost and no 3rd molars in lower jaw; P.B. flat with blunt edgi 
Aging; sagittal suture closing; teeth worn but complete; P.B. upwards with sha 
Teeth worn; P.B. flat with blunt edge; J equal. 
Aging; sagittal suture closed; teeth worn; P.B. upwards with blunt edge; S equ 
jecting on left. Only ramus R. of mandible. 
Aging; sutures closing; teeth worn but complete; slight tympanic perforation L. ; 
Teeth worn; no 3rd molars in lower jaw; P.B. flat with blunt and double edge; 
Outer palate bridge L. and inner palate bridge R.; P.B. downwards with blunt 
Damaged calvaria and detached facial bones; teeth worn. 
Aging; calvaria only; sutures closing and vault falling in at obelion; SL; JR; we 
Aged; facial bones and ex-occipital missing; teeth of mandible sound out worn; 8 e¢ 
P.B. upwards with sharp edge; SR; J equal; no mandible. 
Teeth perfect; P.B. fiat with blunt edge; S equal; JL. 
Aging; sagittal suture closed; teeth very worn and 2 lost; P.B. upwards with s 
with frontal. 
Teeth worn; P.B. upwards with sharp edge; 8 equal; JR; wormians lambdoid I 
Teeth perfect; P.B. upwards with sharp edge; SR; JR. 
Aging; coronal suture closing; no 3rd molars in either jaw; left facial and temp 
Teeth worn; slight tympanic perforation R.; P.B. upwards with blunt edge; 8 
5 mm. high) on left supra-mastoid crest. 
Tympanic perforation R. and L.; P.B. wpwards with sharp edge; S equal; JR. 
Sagittal suture closing; teeth worn but all sound; P.B. flat with sharp edge; SR 


Sagittal suture closing at obelion; ceeth very worn and many lost: P.B. upware 
lambdoid R. and L.; post-coronal depression and very depressed roots of nasi 


Teeth worn; P.B. upwards with blunt edge: S equal; JL; fiattened obelion. 
Several teeth lost; sutures beginning to close; P.B..flat with blunt edge; 8 equal 
frontal R. and L.; wormians lambdoid R. and L. 
Teeth worn and 1 lost; P.B. flat with rounded edge; 8 equal; JR. 
Sagittal suture closing; teeth very worn; P.B. upwards with sharp edge; S equal 
P.B. flat with rounded edge; 8 equal; J equal; epipteric bone L.; single ossicle in sé 
Teeth worn and no lower 3rd molar R.; P.B. upwards with sharp edge; S equal; 
P.B. upwards with sharp edge; S equal; large epipteric bone R.; wormians lamb 
Aging; sagittal suture nearly closed; teeth worn; small upper 3rd molar K.; ve 
sharp edge; S equal; JR. 
Young adult; 3rd molars not fully erupted but teeth worn; P.B. flat with blunt 
Teeth worn; S equal; JL; P.B. flat with sharp edge; small wormians lambdoid 
occipital bone. 





MEASUREMENTS ON 48 CRANIA OF THE FIRST DYNASTY (Lenetus in Mir 


Angles 


1 . REMARKS (See Section 2 of Text) 


75°-2 | 43°3 | 29%5 | 13°8 | 89°-0 | Teeth worn; P.B. downwards with sharp edge; SR; J equal; ossicle of \ and wormians lambd 
All teeth except 6 lost; P.B. upwards with sharp, double edge; S equal; J equal. 





75°-0 | 43°°8 33°-0 | 1o°8 | 85°-8 | Sagittal suture closing at obelion; 5 teeth lost; P.B. downwards with blunt edge; SR; JR; pre 
76°-0 | 37°0 | 25°-6%| r1°-4%| 87°-4?| Aging; all sutures closing; teeth very worn and 8 lost; P.B. flat with blunt edge; S equal; JL; 
65°-0 | 490 30°-5 | 18°7 | 83°7 | Aging; coronal and sagittal sutures closed; teeth sound but worn; P.B. flat with sharp edge; 
73°°8 | 41°1 | 25°-0 | 16°1 | 89°-9 ing; sutures closing; teeth worn; P.B. downwards with sharp edge; S equal; JL; epipteric | 
O8°-9 | 45°5 | 27°0 | 18°5 | 87°-4 | Aging; all sutures closing but teeth perfect; lower 3rd molar L. lost; P.B. flat with sharp edge 
76°-9 | 45°°5 34°°3 | 11°2 | 88°r | Aging; sutures closing; teeth worn but complete; P.B. flat with sharp edge; S equal; JR; asy 


inionic protuberance. 
2 Deep palate; P.B. downwards with rounded edge; S equal; J equal. 
3°*3?) 15°47?) 87°-9?| Teeth sound but very worn; P.B. flat with sharp edge; SL; JR; flattened obelion. 
Calvaria and mandible only; tympanic perforation L.; 8 equal; tripartite interparietal with os 
72°-4 | 47°-0 | 30°52) 16°-57| 88°-g?| Sagittal suture closing at obelion; teeth worn; P.B. downwards with sharp edge; SR; JR; si 
epipteric bone R. 


73°°4 | 46°5 | 31°96 | 14°-9 | 88°-3 | Aging: teeth very worn; coronal suture closing; 6 teeth lost from upper jaw; P.B. flat with sh 

70°-o | 44°2 | 27°4 | 16°8 | 86°-8 | Teeth sound but worn; P.B. downwards with sharp edge; 8S equal; JR; small epipteric bone I 

72°-0 | 43° | 33°1 9°-9 | 81°-9 | No lower 3rd molars; P.B. upwards with sharp edge; S equal; JR. 

69°-2 | 46°5 | 32% | 14°-4 | 83°-6 | Teeth complete but worn; P.B. downwards with rounded edge; SR; JR; complex lambdoid sut 
spine 10 mm. long below and to the left of the inion. 

71°! 42°°7 30°'r | 12°-6 | 83°-7 | Adult but no upper 3rd molars; teeth worn; tympanic perforation L.; P.B. upwards with blunt 


near asterions. 

Teeth worn and 4 lost; no lower 3rd molars; slight tympanic perforation R.; P.B. downwards 

Upper 3rd molars lost and no 3rd molars in lower jaw; P.B. flat with blunt edge; 8 equal; JF 

Aging; sagittal suture closing; teeth worn but complete; P.B. upwards with sharp edge; S eq 

83°-9 | Teeth worn; P.B. flat with blunt edge; J equal. 

5°*4 | 50°9 | 31°42) 19°52?) 84°-o%) Aging; sagittal suture closed; teeth worn; P.B. upwards with blunt edge; S$ equal; JL; post-« 
jecting on left. Only ramus R. of mandible. 

-o | 42°4 | 28°42) 14°-0%| 80°07) Aging; sutures closing; teeth worn but complete; slight tympanic perforation L.; P.B. flat with 

67°°3 | 43°°6 | 33°0 | 107-6 | 77°-o | Teeth worn; no 3rd molars in lower jaw; P.B. flat with blunt and double edge; SR; JR; post 








74°°5 | 46°°3 1-8 | 1495 | 89°-0 | Outer palate bridge L. and inner palate bridge R.; P.B. downwards with blunt edge; SL; sin, 
Damaged calvaria and detached facial bones; teeth worn. 
Aging; calvaria only: sutures closing and vault falling in at obelion; SL; JR; wormians lamb 
Aged; facial bones and ex-occipital missing; teeth of mandible sound but worn; 8 equal; sutures 

71°9 | 43°°7 31° | 12° | 84°-0 | P.B. upwards with sharp edge; SR; J equal; no mandible. 

70°2 48°°0 26°°] 11°-9 | 82°1 | Teeth perfect; P.B. flat with blunt edge; S equal; JL. 

69°°2 17°°9 1 a | 2°-8 82°-0 | Aging; sagittal suture closed; teeth very worn and 2 lost; P.B. upwards with sharp edge; S| 

with frontal. 
70-5 | 407 | 262 | 14°%5 | 85°0 | Teeth worn; P.B. upwards with sharp edge; S equal; JR; wormians lambdoid R. and L. 
3-0 | 4r°-o | 25°0 | 16°-0 | 89°-0 | Teeth perfect; P.B. upwards with sharp edge; SR; JR. 
$o°-0 | 41°-0 | 28°-3%| 12°-7%| 92°-7?| Aging; coronal suture closing; no 3rd molars in either jaw: left facial and temporal bones mi: 


73°°8 42°°8 31°55) -11°%3 | 85°%r | Teeth worn; slight tympanic perforation R.; P.B. upwards with blunt edge; 5 equal; JL; b 
5 mm. high) on left supra-mastoid crest. 

69°3 | 46°5 | 30%9 | 15°°6 | 84°-9 | Tympanic perforation R. and L.; P.B. upwards with sharp edge; § equal; JR. 

77°4 | 38° | 24°%7 | 13°9 | g1°%3 | Sagittal suture closing; teeth worn but all sound; P.B. flat with sharp edge; SR; JL. 


68°-6 | 42°8 | 29%5 | 13°3 | 81°9 | Sagittal suture closing at obelion; teeth very worn and many lost: P.B. upwards with very 
lambdoid R. and L.; post-coronal depression and very depressed roots of nasal bones. 





69°2 | 45°°8 +4 | 164 | 85°-6 | Teeth worn; P.B. upwards with blunt edge; S equal; JL; flattened obelion. 
09 | 42°°3 16°-5 | 87°4 | Several teeth lost; sutures beginning to close; P.B..flat with blunt edge; 8 equal; J equal; la 
frontal R. and L.; wormians lambdoid R. and L. 
76°-0 | 38°-9 35 3°62 9°-2 | Teeth worn and 1 lost; P.B. flat with rounded edge; S equal; JR. 
75°°5 | 41 29°3, | -12°5 | 88°o | Sagittal suture closing; teeth very worn; P.B. upwards with sharp edge; S equal; J equal; w 


09 | 45°9 | 30%4 «15°5 | 86%4 . PLB. flat with rounded edge; S equal; J equal; epipteric bone L.; single ossicle in sagittal sutur 








5 0 | 41°5 | 33 °°6 7°*9 | 82°9 eth worn and no lower 3rd molar R.; P.B. upwards with sharp edge; S equal; JR; epipte 
72 jO°"4 30"°1 16°°3 | 89° P.B. upwards with sharp edge; S equal; large epipteric bone R.; wormians lambdoid R. and 
4°1 | 39%1 | 28°-2 | 10°9 | 85°-0 | Aging; sagittal suture nearly closed; teeth worn; small upper 3rd molar R.; very slight ty 


sharp edge; S equal; JR. 
81°-5 | Young adult; 3rd molars not fully erupted but teeth worn; P.B. flat with blunt edge; SR; s 
-r | 16%4 | 88°7 | Teeth worn; S equal; JL; P.B. flat with sharp edge; small wormians lambdoid R. and L.; 
occipital bone. 





YTHS IN MILLIMETRES). 


wormians lambdoid L. No mandible. 

al. 

; projecting spine below temporal ridge L. 

ige; S equal; JL; very projecting angles of mandible. 
‘ith sharp edge; SR; JR. 

1; JL; epipteric bone L. 

with sharp edge. 
S equal; JR; a ul occiput; massive mastoids and 


lion. 
srparietal with ossa triangularia only separate. 
»dge; SR; JR; single ossicles in coronal suture R. and L.; 


P.B. flat with sharp edge; SR; JL; ossicle of 2: 
| epipteric bone L. 


lex lambdoid suture with small wormians; blunt projecting 
wards with blunt edge; SR; JL; ossicles in lambdoid suture 
P.B. downwards with sharp edge; S equal; JR. 

-dge; S equal; JR. 

sharp edge; S equal; 2 ossicles lambdoid R. 

equal; JL; post-coronal depression; facial asymmetry, pro- 
L.; P.B. flat with sharp edge; S equal; J equal; no mandible. 

ge; SR; JR; post-coronal depression. 


int edge; SL; single wormian lambdoid L. 


; wormians lambdoid R. and L. No mandible. 
S equal; sutures almost completely closed and vault falling in. 


th sharp edge; SR; JR; process of temporal R. articulating 
id R. and L. 


‘mporal bones missing; P.B. flat with sharp edge; SR; JR. 
; 8 equal; JL; blunt projecting spine (10 mm. x to mm. by 


SR; JL. 


wards with very rounded edge; S equal; J equal; wormians 
f nasal bones. 


‘qual; J equal; large processes of temporals articulating with 


qual; J equal; wormians lambdoid R. and L. 

» in sagittal suture above \ and wormians lambdoid R. and L. 
qual; JR; epipteric bone R. 

lambdoid R. and L.; left side of mandible only. 

.; very slight tympanic perforation L.; P.B. upwards with 
unt edge; SR; simple interparietal. 

idoid R. and L.; traces R. and L. of horizontal suture across 








KARL PEARSON 53 


Note.—The Definition of the Alveolar Point (see p. 20 above). By Kart PEARSON. 


There has been much vagueness about the localisation of the so-called alveolar 
point. It culminates probably in Flower’s definition*: “ Alveolar Point, the centre 
of the anterior margin of the upper alveolar arch.” This might appear capable of 
interpretation had not Flower defined what we term the profile length and then 
described the Basialveolar Length as “ Basion to alveolar point, or the most distant 
part of the anterior margin of the alveolar arch.” 


Now the most distant part of the anterior margin is rarely in the mesial plane 


of the skull, but on the anterior face of one or other of the front incisors on either 
side of the mesial plane. This difficulty has been recognised by certain authorities, 
and they define the alveolar point to be the point in which a tangent to the border 
of the alveolar arch on these incisors meets the mesial plane. This is a fairly 
definite point, but as if usually “hangs in the air” it is not capable of accurate 
determination in the case of the average skull; it is not at all a suitable definition 
for the Knotenpunkt of a whole series of cranial measurements. We next turn to the 
classical French craniometricians. Paul Broca in his Instructions crantologiques et 


+, writes as follows: 


crantometriques, p. 42, Paris 1875 

La région incisive, comprise entre le nez et la bouche, est formée par la partie de Varcade 
alvéolaire qui supporte les dents incisives. On y remarque, sur la ligne médiane, la suture incisive, 
dont Vextrémité supérieure aboutit a ’épine nasale et dont Vextrémité inférieure, correspondant 
au milieu de Parcade alvéolaire, se nomme le potnt alvéolaire. 

The alveolar point is here defined to be the “lower extremity ” of the incisive 
suture, and this is said to correspond to the middle of the “areade alvéolaire.” But 
where does the incisive suture terminate? Some might hold that it runs as far as 
the incisive foramen, but no one has ventured to place the alveolar point there. At 
any rate if it be an extremity of the suture we are seeking, it is more likely to be 
on the posterior than the anterior face of the alveolar process, 

Topinard defines the Point alvéolaire as the “ point le plus antérieur et le plus 
déclive du bord alvéolaire supérieur.” 

Unfortunately in the great bulk of crania the two characteristics by which 
Topinard defines the alveolar point are directly contradictory, the most anterior 
point is not the lowest point. 

furning to English writers we have already referred to Flower. Cunningham§ 
identifies the Alveolar Point with the Prosthion and defines it as “the centre of 
the anterior margin of the upper alveolar margin.” This leaves us in a state of 
perplexity as to how a margin can have a margin, much less two margins, as 
presumably if the upper alveolar margin has an anterior margin it has also a 
posterior margin. Should the definition be translated “the point where the anterior 
margin of the upper alveolar process cuts the mesial plane,’ we must point out 


that the alveolar margin between any pair of adjacent teeth is extraordinarily vague 


Cataloque of the R.C. of S. Museum, Homo Sapiens, Part a pp. xiii and xviii, 1879. 
+ See also Mémoires de la Société @ Anthropologie, T. 1m. 2° série, pp- 96 et seq. 
Lléments @ Anthropologie Générale, p. 251, 1885. 


Text-book of Anatomy, p. xvi, Appendix D, 1909. 
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and often lacks determination, and, according to the personal equation of the 
observer, may lie anywhere from the geodesic linking up the mid anterior points of 
the alveolar margin on the incisors to the lowest point of the central process between 
the incisors. Quain’s Anatomy is more concise ; it reads* : 

“ Below the nasial aperture is the intermazillary suture; the lower point of this, 
on the alveolar margin between the central incisors, is named the alveolar point 
(prosthion).” 

This definition is more or less a return to Broca’s where we have reference to the 
incisive suture and its lower point (if not its “ extremity”); the lower point is said 
to be on the alveolar margin, but then this margin does not terminate the suture 
and the position of the margin between the central incisors is the matter under 
discussion. 

Turning to the Germans we naturally consult first the Frankfurter Verstdndi- 
gung (Archiv fiir Anthropologie, Bd. xv. 1884), but the only clue to the alveolar 
point is to be found in the definition of the upper face height (Obergesichtshihe). 
This we take from nasion to alveolar point. The Versténdigung defines it as “ von 
der Mitte der Sutura naso-frontalis, bis zur Mitte des Alveolarrandes des Oberkiefers 
zwischen den mittleren Schneideziihnen ” (S. 3). This certainly seems identical with 
the lowest point of the central process between the middle incisors. The diagrams 
attached to the Verstdéndigung throw no more light on the accurate localisation of 
the alveolar point than those in Broca, Topinard, Cunningham or Quain. 

E. Schmidt in his Anthropologische Methoden, 1885, gives 8. 210 the following 
definition : “Alveolarpunkt (Point alvéolaire), Kreuzungspunkt des Zahnrandes des 
Oberkiefers mit der Medianebene.” Here again we ask what is the “ Zahnrand,” 
the tooth-border, at the mesial plane? R. Martin in his Lehrbuch der Anthropologie 
(S. 514, 1914) sees something of the difficulty and separates the Prosthion from the 
point to which the upper face height is to be taken. Thus: “ Prosthion [pr]= 
derjenige Punkt am Alveolarrand des Oberkiefers, der in der Mediansagittal- 
Ebene zwischen den mittleren Schneideziihnen am meisten nach vorn vorragt. Das 
Prosthion liegt also nicht an dem unteren Ende des zwischen die Schneideziihne 
vorgeschobenen Knochenteils, sondern an der am meisten nach vorn vorstehenden 
Stelle seiner Vorderwand. Nur fiir die Messung der Obergesichtshéhe ist der 
Messpunkt an die Spitze des genannten Fortsatzes zu legen.” 

Thus Martin agrees with our interpretation of the Frankfurter Verstiéndigung. 
But he proceeds at once to contradict himself for he defines (S. 553) the Oberge- 
sichtshéhe (or, the naso-alveolar height for he cites the English term) as the 
“Geradlinige Entfernung des Nasion vom Prosthion,” thus introducing confusion 
into the Verstdndigung, as well as into his own book. But is it always the case that 
the most forward projecting point of the alveolar border lies between the middle 
incisors? We think not. Again, is that border generally distinguishable when it 
meets the mesial plane? Martin appears to identify it with the most projecting 


point of the anterior wall of the central process between the middle incisors; but 


Elements of Anatomy, Vol. tv. Part I, p. 113, 1915. 


t We have heard a stringent critic assert that there cannot be two central or two mid incisors, which 
is perfectly correct, but the term is convenient, and what is good enough for Quain is permissible for us. 
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it is the exact boundary between the alveolar arch and the central process between 
the mid incisors which is so difficult of determination. 

But this is not the greatest difficulty. If there be two points used, the prosthion 
and the alveolar point—defined, as the Biometricians have defined it since 1895, as 
the lowest point of the central process between the middle incisor teeth—then the 
computation of the angles and sides of the fundamental triangle of the skull 
becomes impossible. Martin would measure the upper face height (according to the 
Frankfurter Verstdndigung) to the alveolar point, but the profile length to the 
prosthion*, and he would take his profile angle from the nasion-prosthion line+. It 
is clear that there could be no orientation of the fundamental triangle (Gesichts- 
dreieck) unless all the measurements were taken to the same point, and accordingly 
somewhere between his pages 514 and 562, Martin has given up the alveolar point 
as the extremity of his upper face height. The Verstdndigung itself is very obscure 
on the measurement of the Upper Face Height and of the Profile Length. We 
have already given its definition of the former. The latter length is thus defined 
(S. 4): “von dem am meisten vorspringenden Punkt der Mitte des fusseren 
Alveolarrandes ¢ des Oberkiefers bis zum vorderen Rand des Foramen magnum (in 
der Medianebene) gemessen.” It is by no means clear that the alveolar extremities 
of the upper face height and the profile length are to be taken to the same poini ; 
it looks as if the former was to be taken to the alveolar point and the latter to a 
point which may or may not be the prosthion. 

It was experience of the indefiniteness and positive confusion in this matter 
and the need for selecting a point from which all measurements may be taken, and 
from which the orientation of the fundamental triangle (Gesichtsdreieck) with regard 
to the Frankfurt horizontal plane may be determined, which led to our selection of 
the mid-point of the central process between the middle incisors as the alveolar 
point. As a matter of fact we have preferred to define it not as the mid-point, but 
as the lowest point of the central process when the skull is orientated to the 
Frankfurt plane. As far as the fundamental triangle is concerned there is no 
more vali] reason for selecting the anterior than the posterior face of the alveolar 
arch. I regret only that we did not take palate lengths from this alveolar point 
rather than from a point on the posterior alveolar margin as vague as the prosthion 
on the anterior alveolar margin. 


Loc. cit. 8. 550. t Loc, cit. S. 560. 
{ There seems here to be some confusion of the alveolar border and alveolar arch ; for if Alveolarrand 
signifies the alveolar border or lower margin of the alveolar arch, this is presumably a curve which can 
have only one middle point, i.e. intersects the mesial plane in a single point. What is therefore the 
sense of introducing the words ‘‘ the most projecting point ” ? We can’t choose a point within a point. It 
would appear therefore either that border is used for arch, and the point in question may lie anywhere 
on the incisive suture where we reach the greatest distance from the basion, or that the Alveolarrand is 
an area in the neighbourhood of the alveolar border on which area we are to choose the point most 
remote from the basion. How this area is separated from the anterior face of the alveolar arch we do 
not know, nor have we found it anywhere stated. Another alternative, namely that the mid-point of the 
alveolar margin is invariably the point furthest removed from the basion, would not only render the 
definition tautological, but it involves a dogma,so obviously erroneous that it need not be discussed. 
§ The importance of this will be readily realised by the reader who has examined the central process 
in numerous crania ; central processes occur which are wedge-shaped V rather than semi-circular U . 
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I realise that objection may be taken to measuring the profile angle as we have 
done, probably on the ground that the nasion-prosthion line is nearer to the profile 
slope in the living than is the nasion-alveolar point line. But both are so far off 
the living profile—owing to the fiesh thickness of the lip—that the small horizontal 
projected distance from prosthion to alveolar point makes little difference compared 
with the great divergence between the cranial and living profiles. This argument, 
indeed, might justify the use of Cuvier’s facial line to the tip of the incisors* as 
better in its turn than using the nasion-prosthion line. 

If further justification be needed for my choice in 1895 of the alveolar point as 
on the central process of the middle incisors I think it may be found in the 
Entente Internationale ot 1906, resulting from the Monaco Congress*, for the unifi- 
cation of craniometric measures. Internationally this concordat should replace 
the Frankfurter Verstindigung. Therein we read under the definition of the 
Diamétre naso-alvéolaire, our upper face height, that the lower extremity of this 
measurement is “le point le plus inférieur du bordt alvéolaire entre les deux 
incisives médianes et supérieures.” 

Again for the Diamétre alvéolo-basilatre, our profile length, the anterior ex- 
tremity is the “point alvéolaire, ou point médian du bord antérieur de l’arcade 
alvéolaire.” Although the Frankfurt Concordat had been on trial for more than 
twenty years the distinguished craniometricians of the Committee of the Monaco 
Congress do not appear to have agreed on a common alveolar point for all their 
measurements or, if they did, they describe it in one of their definitions as the 
lowest point, and in the other as simply the mid-point on the alveolar margin, which 
itself has a very dubious existence in the mesial plane! 

It is very clear that one and all these definitions contain fossilised fragments of 
earlier conceptions; they can hardly be the product of craniometricians, who have 
had to struggle with the actual difficulty of determining the prosthion or alveolar 
point on long series of crania, nor of those who realise the importance of elimi- 
nating personal equation. 

We might define the prosthion as the point in which the geodesic§ on the 
anterior face of the alveolar arch between the mid-points of the anterior faces of the 
middle incisors at the anterior alveolar border meets the incisive suture; and 
again the alveolar point as the point in which the incisive suture or the mesial 
plane meets the lowest element (relative to the Frankfurt horizontal) of the central 
process between the middle incisors. In other words it is the lowest point of the 
section of the above central process by the mesial plane. If the section be straight 
edged \—’, then we must select its middle point as alveolar point. 

We believe that our alveolar point admits of far readier determination than the 
prosthion, and when it cannot be ascertained owing to absorption of the alveolar 
borders, then also no prosthion is available. 

* Or better still a tangent from the nasion to the alveolar arch in the mesial plane 


t L’ Anthropologie, Tom. xv. pp. 563—7. 


t Does ‘‘bord” here signify ‘“‘arcade”? For if not there is again tautology if the ‘‘ bord” is a curve 
meeting the mesial plane. 


§ In practice the geodesic i 


determined by drawing with a fine pencil the shortest line (curved not 
straight) between the mid-points of the anterior alveolar border on the middle incisors. 




























ON THE MOMENTS OF THE HYPERGEOMETRICAL SERIES. 


, 

| By Proressor V. ROMANOVSKY. 
| (University of Taskend.) 

' 

IN a paper of the same title Prof. K. Pearson (Biometrika, Vol. XV1. pp. 157—162) 
has given a finite difference relation, which enables us to write down the successive 
moments of a hypergeometrical series about its mean. This relation is very elegant 
and leaves nothing to desire if we evaluate the moments one after another. But it 
does not enable us to write down any desired moment without knowing the pre- 
ceding ones. The aim of this article is to provide a formula, which permits us to 
obtain the moments of a hypergeometrical series independently. 

Let a=—r, B=-- qu, y=prn—r+l, and 

: : Fi , gn r(r—1) gn (qn —1) ; 
F(a, B,y, c)=A 1+ d @+ ( 4 Y - a ee 
; Lpn—r+l 1.2 (pn—rt+1)(pan—7r+2) d 
where 
| _ pn (pn — 1)... (pm —r+1) 
n(n—1)...(n—7r4+1) 

Here the coefficient of « is the probability of drawing ¢ individuals without 
some mark in a sample of size 7 drawn from a population of x containing pn marked 
and gn non-marked individuals, 

Let #(t) stand for the mathematical expectation of ¢*. Then 

‘ E(t)= rq, 
| and it is easy to see by successive differentiations of the function F, that 
i : 
{ . asF 
BUt(t=1)...@-s4))=| | ROSWaR natn seee RMN (1), 
L das 2-1 
. ask ; 
or PRES pe) | ae eee reer eves (1 bis), 
da* |, 1 
if we put 
Hle(é —1) ... €— st P= Oy ......00cmsepeecsveseet (2). 
The function #’(a, 8, y, ) satisfies the well-known ditferential equation 
(id er ; 
a(1—«) > + (y— 0x) —aSF = 0, ° 
é da” dx 
where o6=at B +1l=l-r- qn. 
* if F(-1r,-qn, pn—r+l1, x) be written aj+a,x4+ dgu? +... +a,x'+..,+a,2", we have _ (a,) =1 


. » =? . oe . > ° ° 
and £ (f(t)) Sig (4ef (t)), or E (f (t)) is the mean value of f(t)(t=0 to r) weighted with the proba- 


bility of drawing t unmarked individuals in the sample of 7. 
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Differentiating this equation s times we have 
[+2 RF ~ . aetif “BF hae 
— +s(1 — 22) —— ry Bey t + (y— 82) = 

2 das} das da: 


e+ 
La® s+1 , 


a(1— “)- 
( 


ask 8 he a 


| — sd a “a. * 0. 
Put here «= 1 and taking regard to (1) and (2), we have 
(y —8—s) U,., =[08 + s6+s(s — 1)] U,, 
or (n —8) Uy, = (17 —8) (qn —8) Ug ...cccccscccccsccscesees (3). 
Now U, =rq and thus, with the aid of the above relation, we find 


es. r(r—1)...(7—s+ L) gn (qn —1l1)... (qn - s+1) 
pe n(n— 1)... (n—s+4+1) 
The sth moment ps of our hypergeometrical series is defined by the relation 
w= K(t— rq). 
This may be written 
ft, = Dy [ K,(t)] 


+ ee AK, (0 A’k, (0 
=E | K, (0) +t a 4 4(t—-1) = 4 
AtK, (0) | 
+t(t-—1)...(t-—s41) ni ) : 
8 i 
if we put 
(t— rq) = K, (t), 
and apply the interpolation formula of Newton. 
Therefrom 
: AK,(0) ,, . A*K,(0) .| Atk, (0) ,, 
pes = K, (0) + = | i+ 2s ae ee wah Y Rt nt (5). 
“a! 8: 
Here 
ANK, (0) = [A“K, (t)]r-0 = A*0® — Cp rgA"08— + C2 (rq A’0*? — ... —...(6), 
as it is easy to see, 
Let us put 
AOA a) 
ht = hy ae Ey ES Se Re eer aNd He (7). 
Applying the relation 
AM = h* — C)) (h—1) + C2 (h- 2p — ... #(— LPOG. D.... (8), 
we easily deduce that 
) hah. Wyn +(h—1),_, Tree rer reece rere eee eee (9), t 


for all positive integer values of h andd. It must be remarked that 
Agee} and Aya @ FOr BREAK, ocicccccssccacsaceveccds (10). 
With the aid of these relations we easily find the values of h, for different 
values of h and x. We place here a short table of these values. 
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Table of values of h, for h, X= 1, 2, ..., 10. 





’ r 
h ai 
1 2 3 J 5 ( 8 ) 10 
1 l l l l 1 l l 1 l 1 
2 — l 3 f 15 31 63 127 255 511 | 
—_— _ l 6 25 90 301 966 | 3025 9330 
j — l 10 65 350 1701 | 7770 | 34105 
5 — = — l 15 | 140 | 1050 | 6951 , 42525 
6 —— — l 21 266 | 2646 22827 
7 —-,-—- | = : —|— | 28 | 462 | 5880 
8 — _— — l 36 750 
9 — — . — — — | l 45 | 
10 —_ — ns nti oe 1 
Now put 
A’K,(0) 
it MEG 4 sineeineemedenel (11). 
v: 


Then, according to (6) and (7), 
Dnjs=hs— Cohsarg + Cohs_s(rq? — ... +(— 8 Cfo (ray, 
or, in consequence of (10), 
= — Che rg + Cehs_2(rqy = .(— 1p Ce (rq yo .. ATS): 
The polynomials Z,,, for s £10 can be written down by the aid of the table. 
For greater values of s we must recur to (8) or (9). Putting, for example, s = 5, we 
shall have 
Ly, s=—(7rqy, 
LL, 5= 1— 5rq + 10 (rq? — 10 (rq) +5 (rq), 
L,,5= 15 — 35rq + 30 (rq? — 10 (rq), 
[, = 25— 30rq + 10(7q)’, 
L,;=10—- 5rq, 
=. 
We see from (5) and (11) that 
pg Dey ot Bg Oy Egg Ua > .02 + Digg De sencsvserensees (13), 
which is the relation I proposed to establish. 
For example, taking s=5, we find 
ps= —(rgqyr+ [1 — 5rqt+ 10 (rq)? — 10 (rq) + 5 (rq)*] ae 
r(r—1)qn(qu—-1) 
n(n—1) 
r (vr — 1) (rv — 2) qn (qn—1) (qn—- 2) 
n(n — 1 y(n — 2) 


4 | = 35rq + 30 (rq y— 10 (rq)'| 


4 [25 — 30rq + 10 (vq | 
r(v—1) (v7 — 2)(r — 3) qn (qn — 1) (qn — 2)(qn—3) 
n(n —1)(n — 2)(n—3) 
r(r—1)(r7 —2) (7-3) (7 — 4) qn (qn - 1) (qn — 2) (qn — 3) (qn — 4) 
’ n(n—1)(n —2) (n—38)(n — 4) F 


+[10— 5rq] 
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Putting in (13) n=, we shall find the sth moment for a binomial (p+ q)’ 
about its mean 

Ms=L,.t+hysrqttesr(r-YGt+...t¢L,.r(v—-1)...(r-—s +1) q*...14), 


where the coefficients Z,,, keep their values. This relation may be transformed 
into a more condensed form which is indicated in my note on the moments of a 
binomial (p + q)" (see Biometrika, Vol. xv. formula (5)). 


It is not difficult to establish for hypergeometrical moments the relations 


; 1 [te 2 
lim (plos/1s*) = ae“ da, 
Var / -« 
: a. oe. 
lim (pos pet?) = ast e—2" da, 
Nar J —« 
for n— 2 and r—~+x. Therefore, according to the theorem of Tchebycheff- 


Markoff, we conclude that, for a hypergeometrical series, the probability of 
inequalities 


_t—7 
t, = Es te, 
V fe 
: l ff 
for any given ¢, and ¢t,, tends to — e-@ da forn— © and ro, 1.e. for n and r 
Var ty 


sufficiently great the hypergeometrical distribution is near to the normal curve, 
as was long ago indicated by Lapiace. (Mémoires de mathématique...présentés a 
lV’ Académie, Tom. Vi. p. 6, 1774.) 


' 

| Note The two methods, 1.e, deducing the higher moments in succession by 
Pearson’s formula and deducing the higher moment directly by Romanovsky’s 
formula, were applied to obtain the fifth moment when 

n= 100, p= b> qg=% and += 10. 
The first gave 
fo = 75, Ms = — 288, wy = 2°156,743 and pw, = — 4122411 in succession. 
The second gave directly 
. "wy, = ~ 4122410. 

There was very little difference in the time or labour required to reach y,, 
using an arithmometer. But the former gave with no greater expenditure of 
labour, #2, #; and py, which also would generally be required, and the work would 
be very much longer by the second method, if all the moments were required. If, 
however, Romanovsky’s method were used to check the pu; of the first method it 
would at the same time check all the earlier moments, and this indicates a decided { 


field of usefulness for the formula. Eb. | 








PROBABILITY INTEGRALS FOR A HYPERGEOMETRIC 


SERIES. 
By ProrEssor BURTON H. CAMP, Pu.D. 


(1) IF a bag contains pn black balls and gn other balls, where p+ q =1, the 
probabilities of obtaining r, 7—1,..., 1, 0 black balls in one drawing of r balls at 


a time are given by the terms of the finite hypergeometric* series . 


yi) (1) —1 dis n—y)(n—y-] ) 
{ ee n—yYy — ( y y # oe 
n”) | li y-—r+1 2! (y—r4+1)(y-r4+2) ) 
where y= pn, n— y=qn, y=y(y—-1 )...(y—r+1), ete. The probability of obtain- 
ing at least s = 7 — t black balls is given by the sum of the first (¢+ 1) terms of (i), 
which may be written in the form 
. l ro) ri ) = 
Staly)= =) ly 7 gy") (a — y)O +... + a 9" yt pakared (11) 
=u(0)+u(1)+...+ u(€). 


It is important to find a ready method of calculating the sum in (ii). This sum 
is what is meant by the probability integral for the hypergeometric. It is easily 
seen that, in limiting ourselves to the consideration of the sum of the first (¢ + 1) 
terms, instead of including also the sum of the last (+ 1) terms, we are really not 
losing generality, for the first sum would become the second if the values of p and 
q were interchanged, and the sum of all the terms is unity. 


(2) The general plan and notation of this paper will follow fairly closely those 
of an earlier one, and we shall make use of certain of the results given there}. In 
particular; if A denote the tail of any frequency function /(£) such that f2 0 if 
a = & <b, and f= 0 elsewhere, and if ¢ 


SiS (€) AE=1, A = DiS (E) AE = {[f(a) +f (a t+ AE) +... +f (7)} AE (iii), 


~ 


then we have, using the symbol “=” to mean “is approximately equal to,” and 
Sheppard’s § notation, 


™ ania’ 
A=$(0) ic + EAv + 3( ay )\ 


* Cf. W. P. Elderton, Frequency Curves and Correlation, Layton, London, 1906, p. 37. 

+ B. H. Camp, “Probability Integrals for the Point Binomial,” Biometrika, Vol. xv1. pp. 163—171. 

+ The reader should note that, in the notation to be used, > is a sum which includes the values at 
the limits a and r. 

§ Tables for Statisticians and Biometricians, 2nd Edition, Part 1. 1924, pp. 2—11. 











62 Probability Integrals for a Hypergeometrie Series 


where, in the order suitable for computation, 


. Av Av 
v=T—£&, o(v)=/(r-»), Av=AE, ga! f r=\/@ otee) 


(0) p(2Av) 
c, =— log. QR, c.=log, R, = V2c,Av, « = co, 
, 3-2 5 — 10a? 4+ a4) ; 
k= 3 ( zy was . aoe ee Reeereine or (iv). 
Sometimes certain terms of (iv) may be neglected, and then 
A= 0Om EES DOE LET ER CARER (v). 


(3) Special Case (Q small). It is only when & is small that (v) suftices, but 
when the stump is far from the mode this often occurs. We may then apply (v) 
directly to (11) and obtain 

_» Ree ee (v1), 
1—Qit) 
where w(t) is the first term in the stump, viz., 


Siu (y) = 


+(t) 


1 oe 
u =n” (n—y)", and Q(t)= 


7 


t(y—s) 
is+1)(n—y-—t4+1)° 
Sometimes it is worth while computing a few terms of the tail before applying 
(vi). When three extra terms are computed, S,4,(y) takes the form 


l l l 1 
S, yy=u(t-—3 ' 

yore ce 1 =9@=t UGS * e¢-H OG=D 
* . (vi be 
> “Tryst eeeeee /1 OWS 
Q(t—2)° Q(t-1)° QS ) 

where 
jo-9-_c > oe * L  _(s + 8)(n—y—t+3) 


~ (8+ 4)(n— y—t4+4)’ Q(t—2) — (t—2)(y—s—2) iat 
(4) General Case. In order to be able to use for the general case methods 
analogous to those employed for the point binomial, it is necessary to show that 
an analogous transformation can be made of the sum S,4,. The result will not be, 
as in the case of the point binomial, an integral, but instead another sum. However, 
this new sum will be found to possess a certain advantage due to the fact that the 
point binomial integral is its limiting form. 
First, let us difference (11) with respect to y, taking Ay=1, and noting the 
formulae : 
A(n—y)% =—t(n —y-—1)*, 
Ay” (n—y)® =sy® (n— y) —t(y+1) (n-y—-1)™; 


1 »(2 3 
ASi (Y) = => J pyr + ; ~y"—) (n—y)) + = y 3) (n—y)? +... 
n ! 2 


+(t+1 
? + 


ri) ; re 
a ¥ vin —y)%— ay iy + 1 "a ~9—3)* — 7 (y+1)* (n-—y—-1)% -... 


»(t 
= @ pit D® @-y- Dr} hain (vii). 
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Now write (vii) successively for y=a, y=a+1,..., y=a+ 8, and add. The 
result is 
ASt4 (@) + AS, (a+ 1) 4+... + AS,4, (a +B) 
L nt) _goatp 
i a © eer (Aé = 1) 


= ni”) 


1 pit) ge. a pit) a ae ro) i - é=a+Btl 
n® |(@—1)! (n—&) +Eaapit (n—&) +...t or € (n—€&) se 
dewiceeaten (vill) 


In this let us now put a=s—1,a+8=y-—1. Then the left hand side of (viii) 
equals S,,,(y), for by (ii) Si4,(s—1)=0. On the right the expression in brackets 
vanishes at the lower limit €=s —1, for each term involves (s — 1) with an index 
greater than s — 1, omitting the trivial case where ¢=0. Thus we have 


pit) ~t=y-1 


/ l 
Stn (Y) = i wig om — G)" DE + O(y), 
where 


re ee ying PO wir (n — yy $e yaK 
y)=— < y) — vy) yer) (n — “) ety” —y)y” 
(y n™ \(t— yd (n—¥ @—2)14 iy ory n—y) 


In order to sum the series @(y), we may repeat the method just used. Differ- 
ence with respect to y, getting Aé (y). 

Then write 4@(y) for y=a,y=a+1,...y=a+8, and add, obtaining 

AO (a) + AP(a+1)4+...+ AO (a+ 8). 

If we then substitute a=s—1,a+8=y-—1, we obtain @(y), for 0(s —1)=0, 
as already noted. The details, though somewhat lengthy, offer no difficulty, and 
are omitted. We obtain, finally, 

L gir! 3 
O(y)=- 2 


' 


n') (t—1)! “f=s-1 


PEO (n— BO (AE=1) +6, (y), 
where 
ft we ee 
/ _ Ss a 
AY) = wm Xa (@—-1-1)!" 


Now 6,(y) is exactly like @(y), except for sign and certain changes in the 


8 (a — gy 9 (ae A) oc ccccs (x). 


indices. The result of continually repeating this process may therefore be inferred, 
and the final result is 


pletl) — pint (n—&)® (n—&)le : 
y a s ‘ (s—1) _ . 
Stan (9) =o 2 pee F(a Gop to +1) enced (xi). 
The series in (xi) is in the form 
Cg = ei ace Qs waecswsensiassentataneeeavecd (xii), 


where ,,C; is the number of combinations of m things taken ¢ at a time, ete. In 
order to sum it, we write Pascal’s theorem for ¢, ¢ — 1, etc., and add, as follows: 


Y Y Y 
nC _ =i t si m—1¢ t—1> 
' Y ’ 
*~ nf ta = — malt i - if t—2> 
Y Y Y 
nl t—2 = m—1Ut—-2 + m—il t—3> 
pM I, css swe. ceeasmumasmngesensebes enouuaaees (xlil). 
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Insert this result in (xi): 
ped —tay—1 : 
2eu9-1 6° (wn — E— 1) (QE =1) ......... (xiv). 


ng! § 


This is the formula for the hypergeometric which is analogous to (xiv) in my 


Siu (y= 


paper on the Point Binomial. It is of the form (11), and, if we now apply to it the 
approximation (iv), which, as in my previous paper, is expected to be good only 
when s and ¢ are large, we have as 
Meruop [| (Indirect Approwimation): Si, (y) = u(t) , Where u(#) is as in 
PI H LY y P 
(vi), and, in the order suitable for computation, 


(y—s)(n—y+1) / 9 (y —2)(n—y-t +2) 


Q(t) = , k= ; 
vO= Grinn-y—tely “"V SGre-la-yt?) 
I 
¢=—log, QR, c,=log, R, = V2c,, #v=c,, 
; er 1 —a,\ : 
Easin(iv), p=$+ $( ee id ee (xv). 


As in the case of the point binomial, it is necessary that c, 2 0, and this can 
always be secured by a proper statement of the problem initially. That is, if ¢, 
should ever come out negative, one would know that the tail of the hypergeometric 
as chosen had produced in (xiv) a function of & whose mode was within the limits 
of the summation indicated in that formula. The original problem should there- 
fore be restated so that the opposite end of the hypergeometric is selected’ as the 
tail, and then ¢, will be found to be positive. The mode of the function of & that 
occurs is easily found to be at the point &=n(s—1)/(r -—1)-1. 

Example 1. Find the sum of the first 46 terms of the hypergeometric for 
which n= 360, r= 90, p=4. 

75) (241) Soar 
Here y=120, t=45, n—- y= 240, Q= ( A = ‘774 9528, 

‘ : (196) (119) 
/_ (118) (197) 
R=,/Q— c, = 0'251 9864, c,=0°002 9662 102, 

eM (74) (242)? ” om 

l -- ~ _ ’ 

= 0°077 0222, #=3:271 607, H=0°020 983, p= 41920, 


a 
S=1'5720, w(t) =0°0001 1553: 

and the true value of this sum, found by computing and adding the terms, is 

00001 1525. 

(5) Mernop II (Direct Approwimation): As in the case of the point binomial 
one may approximate to the sum of the first (¢+1) terms of a hypergeometric 
by applying the formulae of Section 2 (iv) directly to the series, The formulae 
resulting from this process will now be given, and then, after a few illustrative 
examples, the relative merits of the two methods will be discussed : 

” t(y—s) R= /g @+2)(n-y~-t+ 2) 
(s+1)(n-—y—t+4+1)’ "i (t—1)(y—s-—1) 


5 (Cay @y Cy dey GRO P OB RY COW). sacecscses (xvi). 


Sia = u (6), 


() 
u 


r 





Burton H. Camp 
“eample 2. Find the sum in Example 1 by (xvi). We obtain 


¢, = 0951 358, c,=0-031 2456, —= 0249 982, «=380570, E= 0-078 341, 


Qi 


p = 156828, S= 0-000 115 253, and the true value is 0000 115 254. 


(6) The above examples have been worked with all the precision necessary to 
exhibit the closeness of approximation of which the formulae are capable. This has 
involved the use of inconvenient tables* to find $ (1 —a)/z, and the necessity of 
finding the constants to several decimal places. For many purposes, however, as 
great accuracy as that obtained, for example, by (xvi), is not required. What is 
especially needed is a method which will give results in error by at most one or 
two per cent., and which will involve a minimum of computation. This last con- 
dition is very important, for we are here considering approximate formulae, and one 
always has the option of discarding the formulae and computing the several terms 
of the series outright. The approximate formulae, therefore, must not be too 
cumbersome to employ readily. Both of the methods given above admit of some 
modification, which will now be noted, and the succeeding examples will illustrate 
the accuracy which may be obtained with a minimum of effort. 


First write, in both cases, HW = ¢,/12, and so: 


5 s , l — a = 
Indirect Method IT: Sys, = * u(t) } et “1 ts = «)| is utd AS See (xvii), 
)2 ; 


i ae ee 


¢  o (l—a,)) 
5) a { 


_ ~ xn 


Direct Method IT: Si4, =u(t) 1 + Kaipiticesiesseasaniem (xviii). 
( 


In both cases, find Q and R? by a computing machine, and then log, @ and 
log, R? from a five place table of natural+ logarithms. Obtain o to four significant 
figures by the use of tables of reciprocals and square roots, and find the ratio 
4 (1 —a)/z from Sheppard’s tablet. 


Example 3. Do Example 1 as just indicated. By Method I, S=0°000 116 ; by 
Method II, S=0°000 115; the true value is 0:000 115. 


Example 4. Let n=900, 7 = 450, p=}, t= 120. By Method I, S = 0-000 0145 ; 
by Method II, S=0°000 0144 the true value is 0°000 0145. 


Kxample 5. Let n= 900, 7=450, p=3t= 110. By Method I, S= 1-00 x 107°; 
by Method II, S=1:01 x 10-*; the true value is 1°02 x LO-*. 

Example 6. Let n = 240, r=90, p=4, t= 45. By Method 1, S=0:0000 2285: 
by Method II, S=0-0000 2282; the true value is 00000 2282. 


Example 7. Let n=360, r= 90, p=t, t= 1m By Method I, S=0:002 783; 


‘ — 


by Method II, S = 0:002 756; the true value is 0°002 754. 


* Burgess, J., Transactions of the Royal Society of Edinburgh, 1897, Vol. xxx1x. pp. 257—321. 
} + E.g., B. O. Pierce, A Short Table of Integrals, 2nd Ed., 1910, Ginn and Co., Boston, p. 130. 
t{ Unfortunately, this table is so arranged that the ratio is given poorly in the range, 4S x<4-50. 
A tabulation of the ratio itself would be very helpful here, as with the point binomial approximation. 
Such a table, accurate to five significant figures, is now in course of preparation. 


Biometrika xv 
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Example 8. Let n=900, r=90, p=4, t=72. By Method I, S=0-006 535; 
by Method II, S= 0-006 428; the true value is 0°006 428. 

Example 9. Let n= 900, r= 90, p= 4, t=45. By Method I, S=0:000 422; 
by Method II, S=0000 414; the true value is 0:000 414. 


A critical survey of these examples will indicate that the direct method (IT) 
gives better results. There is no essential difference between the two methods as 
regards difficulty of computation. Is there, therefore, justification for the develop- 
ment of the indirect method? This question may be answered after a brief consider- 
ation of the point binomial case. There the indirect method was preferred, not 
because it gave closer results—indeed, in the published examples it gave less close 
results—but because it gave results which were close enough, and it involved less 
computation. It was, for large values of 7 (x in that paper), the more efficient 
approximation, efticiency being understood to depend partly on the closeness of the 
approximation, and partly on the simplicity of the formulae. Now, if we let 
become infinite in our present indirect formulae, they will approach as limits the 
corresponding indirect formulae for the point binomial; the sum (xiv) of this paper 
will approach the integral (xiv) of that one. Therefore, in border line cases, that 
is, where n/r is large, it is to be expected, and it is in fact true that, without 
serious loss of accuracy, the formulae of this method may be simplified, so as to 
render them analogous to the point binomial formulae, and so as to make them 
easier to use in computing. On the other hand, although the direct method formulae 
are susceptible of an analogous transformation, going over into the direct method 
formulae for the point binomial, they do not result in a corresponding simplification. 
The simplified formulae will now be given and illustrated. 


(7) Use of Indirect Method when n/r is lurge. As before, use tables of reci- 


procals and squares and find the following constants accurately to four significant 
figures : 


s—] t | jo — §\3 / t 2 
Q = - res +41 (- ) + |. 
y-l n-y-t+l L\y—1/ n-—y—t+l | 
i -—-1l l l ; l bas 
o y-1 n—-y-t+l n-yt+2 y-s-l’ pi de 
= ; 8 l—a, : 
Chen Sey, =u (€) (1 { nd =) Sone ake eeweeennue urewees (X1X). 
ey  S- Se 


Example 10. Use the above method in Example 9. We obtain n/r = 10, 
S = 0°000 424; by I, Ex. 9, S=0-000 422: the true value is 0000 414. Thus there 
is no essential difference in the approximation given by Method I when these 
simpler formulae for the constants are used. Since r is only 90, neither result. is 
very close to the true value. As with the point binomial, the success of the method 
depends on r being large *. This method is especially valuable when r is much 
larger, as in the following case. 


* I have intentionally included in both my papers examples where the methods do not apply very 
well, in order that the reader may have some idea of the range of their usefulness. 
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Example 11. Let n= 4500, r= 450, p=2, t= 120. Here also n/r = 10. 
S=0°000 7965 ; the true value is 0:000 7980. The proportional error is only about 
0°19 of one per cent., one-twelfth as large as that in Example 10. Also, the 
last place is in some doubt, due to the inadequacy of the tables for the ratio, 
3 (1 —a)/z. 


(8) Summary. In order that the reader may have the benefit of such experi- 


ggestions 


ence as the author has had in computing from the above formulae, some sug; 
as to when the various formulae should be used will now be given. These suggestions 
are admittedly both tentative and proximate, for the experience on which they are 
based is not exhaustive, and because, when the accuracy of a formula depends on 
four variables (n, 1, p, 8), it is seldom possible to frame a simple, ri 


gorous rule of 


procedure. 

(a) If greater accuracy is desired than can be obtained by the abbreviated 
methods of Section (6), it is probably not worth while using the formulae at all 
unless 7 is at least as large as 300. This would not be true if better auxiliary 
tables were available*, but in our present state it is probably better to compute the 
various terms of the tail by successive multiplications, beginning with the term at 
the stump. 


(b) If pqg> yh, and r< 400, use Method IT of Section (6). If pq > #5 and r > 400, 
use Method II of Section (6) for values of » < 10r, Method I as modified in Section 
(7) for higher values of n. The proportional errors will probably not exceed one- or 
two-tenths of one per cent. in the extreme cases, and will usually be much smaller. 
They grow small as pg and r are increased. 

(c) Formula (vi bis), or even (vi), is always to be used, however, when the Q of 
Method II, which is the same as the Q of (vi), is small, say less than one-third. 
A case in which Q is a little greater than one-third (0°3743) is given in Examples 
2 and 3. The error in S as given by the method of Example 2 is 0-000 000 001 ; 
by II of Example 3 it is 0°000 000 02; by (vi bes) it is 0:000 000 19; and by (vi) 
it is 0°000 002 26. The proportional error by (vi bis) is 0°0017, and by (vi) is 0°02. 


Finally, it is a pleasure to record my indebtedness to Professor Karl Pearson for 


proposing the problem of this paper and for helpful suggestions made during the 


course of its preparation. 


Cf. footnote ¢ on p. 65 above. 








DETERMINATION OF cos"! 9. d0 FOR LARGE VALUES OF 1, 

J0 
AND ITS APPLICATION TO THE PROBABILITY INTEGRAL } 
OF SYMMETRICAL FREQUENCY CURVES. 


By JOHN WISHART, M.A., B.Sc. 


THE Complete Integral cos"! @d6 has been tabled from n =— 1 to 103*,. The 
. vO 


Incomplete Integral 


ro rx , : . f 
cos"! 6d@ = $ (l—wy"a-*dx on putting #=sin’é 
~ 0 “0 
= 4B,(4n +1, 4), ..... Liooreniaibenhanabingschuuenliniren (1) 
and so can be found from Tables of the Incomplete B-function, which are in ? 


progress. But such tables must of necessity be limited, stopping, say, at p= q= 50. 

It still remains a problem then to determine the integral under consideration for 

large values of n. It is the object of this paper to show how our function can be 

evaluated to a very great degree of accuracy by expanding in Incomplete Normal 

Moment functions. The method was first suggested by Professor Pearson as a 

means of evaluating the incomplete ['- and B-functionst, but it was found after 

tables of the moment functions had been made that it was not accurate enough f 
when » was small. Illustrations of the method ir the case of the Incomplete 

B-function appear in T'racts for Computers, No. vu, but the accuracy there 

achieved leaves something to be desired. 


As will be shown later, the curve of which (1) is the integral is one of the well- 


known Pearson frequency types, and hence what is generally required is the ratio 7 
of the incomplete to the complete integral. 
oa 
cos”*! 6dé@ 
Let 4 + Ig(n+ 1) = : 
: Cl s"41 Ode 
cos”*! 6dé@ 
Then Dg (10 1) = a $—_ rerecececcccceccrereeeen (2) 
2 “cos"t! 6d0 
Jo 


* : 
(l—y*)" dy 


- : 
2} -yy"dy 
“0 
on putting y= sin 0. 


* Biometrika, Vol. x1. p. 377 + Biometrika, Vol. v1. pp. 66—8. 
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yn (1 os y pa dy =}. I (m) l (p) 


In this put m=} and p=4n+1. 


(1—°) dy. 


D(dnyU {E(rn4+1)} = 


and we shal! expand the [-functions by Stirling’s theorem. Thus: 


sais wat Vin. im the integral, Teeras in — oud beer powen ace eauletael 
on putting «=Vn.y in the integral, Lerms in — and higher powers are neglected, 


o. Ig(n + 1I)=(1 +a 


Now the rth Incomplete Normal Moment function is 


l " # 1 2 
M,(2)=— ve * dx | (v - L)(v—3)...... y 
V2er Jo 


and so in terms of these 


/ 3 2 
[g(n+1)=(1+, - sa) Mm, (2) 


Q \ b 
Dr + 128), :) m,(a“)— ( 


Ne (a) + Myy (2) — 
: Si! 


2 My (L) — Cy. 4(L) — Cg. Mg(2) + Crp - Myo (HL) — Cyo. Myo(H) ...... (6), 


91875  :0703125 
: + 








x a6 8 


| ~ 4n G2 Bn dx, 


yo on 
+ _ 2. 
24n? = =384an | da 


or i 


/ 
9 
. 


465 = 
“HRs Mas (@) «-(5) 
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In this expression #=Vnsin@. m,(«) is got from the table of 3 (1+ a)* by 
subtracting *5, and the functions m, (#) from + = 1 to 10 are tabled to seven figures*. 
A supplementary table giving m,.(#) appears below. It was calculated by integrating 
twice by parts and combining the results, which gives the relation 

My (xv) = 4 {mg (x) + My (x)} — ae (4 o oI 2x 
ait * 1890 \ ny 
where z, is the ordinate of the normal curve, given by Sheppard’s tables. In a few 
cases where the last figure was in doubt, the integration was carried further, and 
the function expressed in terms of m,(#), known to more places than seven. 
Second differences only are necessary for interpolation into this table, for although 
the third differences are considerable, none is greater than ‘0006, and this, when 
combined with the appropriate coefficient, is insignificant to the order of accuracy 
we desire. 


This result (equation (6)) enables the function J,(n +1) to be calculated with 
great rapidity when once the coefficients ¢, ete. are known, and provided that n is 
reasonably large. Table I gives the values of these coefficients for values of n from 
100 to 400, proceeding by twos, i.e. for values which would come outside the Tables 
of the Incomplete B-function. For still higher values of n, ¢, ¢,, ¢; and cs can be 


: ‘ , i's “ 
calculated from (7), dropping the terms in —,. It is found that for n above 400, the 
* 


terms in — are insignificant in the seventh place, and to a first approximation 
nv” 


. 34 3 5 
y 


I bined tx \ () + 9 5 (x) L105 
+1)= ~ } m, (a) — | x) — (a 
aie \ 4n- 32n?/ ~~ An aa) m4 2n? ARNT 


> . Mg(v)...(8). 
32n7 ( ) 
Now the equation to the symmetrical frequency curve of limited range 


(Pearson, Type [Iu) is 
2B ant (1 = a "+ \ 
«?/ 


De ° ud ° 
If in this we put y =— and m= $n it becomes 
. a 


° 


$n 
¢=2,(1 —y)", 
and hence the probability integral of this type of curve is 


ry ; 
(l-y)" dy 
= - =44+1,(n+ 1), 
(1—y*)" dy ; 
J oil 
from (3). 
So that the function determined by equation (6) is the probability integral of § 
the Type Ila curve, when added to 3. The diazram illustrates the form the curve 
Tables for Statisticians, pp. 2—8 and 22—3. 
+ The type Zo (1 ~~) , reduces to the same integral form if we take z/a=tan 6; no further 
as, 


probability integral is required. 
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takes for large values of n, and the probability integral is the ratio of the shaded 


portion of the area to the whole area, The examples which will now be given in 


Node 






Terr Cee VUGVTCUYT VC Geet 





PTUTETITTE 








— 0 Sire8 +1 


illustration of the theory will show that the approximation (6) determines Ig (n + 1) 
to seven figure accuracy up to a distance from the mode (taken as origin) of three 
times the standard deviation. This is very nearly the whole area. The value of the 
“tail” is given by } — J,(n +1). 


Example I. To evaluate J, (101). 


(a) Let us integrate from the mode to @= sin '3. 


Then « = Vnsin@=3. We have at once from equation (6) and using Table | 
[.,(101) = 1:0074 782 m,(3) — 0075 561 m,(5) 
— ‘0002 519 m,(3) +0003 175 m, (3) 
+ ‘0000 394 m,, (3) — 0000 271 m,(3) 


= 5023 7911] — ‘0033 6600 
8933 9413 

745 308 

*5024 7589 0034 6321 


34 6321 
‘4990 1268 


I. (101) = -4990 127 to seven places. 








oJ 
bo 
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As a check on this result, a value, correct to at least seven figures, was obtained 
from 16 terms of the “ parts” reduction formula. ‘Since 


} 
! 
| 
j 
| 


, taken from the ) 


[ fal — yy)" dy = } + y)" dl = y)" dy 


B, (51, 51) 
B (51, 51) 


mode as origin, and when we remember that our range is + 1 from the mode and 


the function we are evaluating is seen to be equal to 


the integral of the “ parts” reduction formula is measured from the start of the 
curve, we see that 

1 B..,(51, 51) 
I.,(101) = = — 

2 B(51,51) 


l PO 507 {,,49 Tf f,, 86 7) \) 
=-— B5)) (65) (1+ sa. 41 + sexe jee eee <1 — e gee ae 
2~ 1(52) riser “6°”? ( + 52°18) * 8°18 1") * 66 Isis", 
the Remainder Integral being insignificant to this order of accuracy, 
(102 
a = (192) -g5)n (65)" x 1:9984 3246. 


2 F662) F651) 


This expression was evaluated using Tables of the Logarithms of the Gamma- 
Function* and Briggs’ Logarithms to 12 places. 
We have 159-974 325 0285 
24747 470 2618 
10°645 667 8322 
0300 689 4751 
127°668 152 5976 
130°673 719 9211 
4994 452 6765 
I.,(101) = $ — 0009 8726 = °4990 127 to seven places, 
which shows that our value by moment functions is correct to seven figures. A 
further attempt was made towards evaluating the integral, using Weddle’s Rule ; 
(24 ordinates). This gave a value *4990 128. 
“6 


The actual value of cos" @d@, where 0=sin—'3, is, of course, obtained by 
0 


multiplying the value found for Z,(101) by 2 | ~ cos™ @d0, which is tabled. 


(b) Let us now integrate over a smaller range, say to @=sin7'2. Then 


K 2 9 


w=Nn =2. The coefticients are the same as before, but we now require to look 


out the moment coefficients of «=2. We find numerically: 


+ ‘4808 1884 — 0017 0233 
1406 2774 

59 12 

‘4808 3349 ‘OO1LT 3019 


L7 3019 
7 ‘4791 0330 
P Ecol 101) =°4791 033 to seven places, 


x 
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which is again found to be identical to this number of places with the result 
deduced by parts. 

Example II. Consider now the case of a higher power. 

Let it be proposed to find [,(401). Now the point of inflection of the curve is 
given by sn@=y=_, , So for the case considered in Example I the points of 

= 

inflection are +1 from the mode, and in this example +°05. Sometimes an 
approximate formula is found to be unreliable in the neighbourhood of a point of 
inflection. To examine this difficulty, let us take 

(a) Integration to @= sin 05, i.e. to a distance from the mode a little greater 
than the standard deviation, o = ‘0498. 


Then t= Vn sin 6 —) 
Zquation (8) may be used, the extra terms of (5) being found to be of no 
significance in the seventh place. We then have 
I .;(401) = 1:0018 736 m,(1) —-0018 785 m,(1) 
— ‘0000 156 m,{1) + 0000 205 m, (1) 
= ‘3419 8430 — ‘0000 3516 
3520 4 
‘3419 491 to seven places, 
the result to this number of places being, in fact, got from the first two terms only. 
T'wo checks were here employed. Weddle’s Rule, applied to 24 ordinates, gave 
the result *3419 491. Then the integral was expanded and integrated term by 
term, as follows: 


l / l : v) ry Ly 
From (4) Jg(n+1)= (1 + ) " d-yy dy 


. {y. lo 
ai n} \T(4n)} Jo 


_ 2005 1(400) 5) — 200 05Y 200.199 05) 
"= Fe00) ee ats 


We need seven terms of the series for seven figure accuracy. 


(400) 


, Pad ° ew RVF 9 
I ..; (401) = Fam (200) * O857 675 241. 
We have 866°204 354 1864 745°191 717 2888 
2:933 322 8727 120°411 998 2656 


865'137 677 0591 
$65°603 715 5544 
1533 961 5047 
*. I.5(401) = 3419 491 to seven places, 


so that all three methods yield, to this number of places, identical results. 
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(b) To integrate now over the range 3c. 
Let @=sin-'*144. .*. a=20 x 144 = 2°88. 


The values of the moment functions of 2°88 were obtained by interpolation, 
using Everett’s Central Difference Formula 


Zo = 2, + Oz, — 10g {(h + 1) Se, + (8 + 1) &z,}, d=8, d=°2. 
Consideration of the fourth differences throughout the table of moment functions 
shows us that, though in some cases they are considerable, the next term in 
Everett's Formula, taken in conjunction with the maximum coefficient of Table I, 
is negligible to our order of accuracy. 
We then have 


I .44,(401) = 10018 736 x -4980 11624 
— "0018 785 x 4296 294 
— ‘0000 156 x 3463 230 
+ 0000 205 x ‘2476 137 
= ‘4981 373 to seven places. 
The value found by Weddle’s Rule was also 4981 373, while integration term 
by term, a much more lengthy process this time, since 20 terms are required for 


seven figure accuracy, gave 


1'(400) 


T .44,(401) = 
iii 2 51"(200) 


x "1249 425 775 


Conclusion. The foregoing examples prove fairly conclusively that results, 
correct to seven places, can be obtained by expanding the function (2) in terms of 
the tabled seven figure moment functions. The first example would normally be 
obtained from a Table of the Incomplete B-function, when available, but it was 
selected as a case of extremely low indices. It is clear that the larger x is, 
the easier does the computation become, for then equation (5) reduces to the com- 
paratively simple form of (8). 
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TABLE I. 


Values of the Coefficients. 


75 





1 


1 
l 
l 


— ee el ee 
_ 


“0074782 
‘0073320 
‘0071914 
“O070561 
‘0069257 
“OOG8002 


‘0066790 
‘0065622 
‘0064493 
‘0063403 
‘0062348 


“0061329 
‘0060342 
“0059386 
‘0058461 
*0057563 
"0056693 
“0055849 
"0055029 
0054233 
*0053460 
"0052709 
‘0051978 
“0051267 
*0050576 
‘0049903 


*0049248 
“0048609 
‘0047987 
‘0047381 
*0046790 


0046213 
‘0045651 
0045101 
*0044566 
*0044042 


°0043531 
*0043031 
0042543 
0042066 
0041599 


0041143 
‘0040696 
‘0040259 
"0039832 
0039413 
‘0039003 
‘0038602 
‘0038208 
‘0037823 
"0037445 + 


‘0075561 
74069 
72634 
71254 
69925 + 
68645 + 
67412 
66221 
65072 
63962 
62890 
61852 
60849 
59877 
58936 
58024 
57140 
56283 
55451 
54643 
53858 
53095 + 
52354 
51633 
50932 
50250 — 
19585 4 
48938 
48308 
47693 
47094 


+ 


16510 
45940 
45384 
14842 
14312 
13794 
13289 
12795 — 
42312 
41840 
11378 
40927 
40485 — 
410053 
39629 
39215 —- 
38809 
38412 
38022 
37640 


*0002519 


2080 


2006 
1936 
1870 
1807 
1747 
1690 
1636 


1087 
1059 
1032 
1006 
981 
957 
934 
911 
890 
869 


0003175 — 


3053 


Oo Ww © 





1040 
1017 
994 


973 


952 


10 


*0000394 
371 


“0000271 


350+ 


331 
313 
296 
280 
266 
252 
240 
228 
217 
207 
197 
188 


255+ 
241 
227 
215 — 
203 


193 
183 
173 
165 - 
157 
149 
142 
135+ 
129 
123 
118 
113 
108 
103 
99 


95 — | 
91 | 








TABLE I (continued). 


Values vf the Coefficients. 
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L*0037075 + 
10036712 
10036356 
1 0036007 
1°0035665 
1°0035329 
1°0034999 
1°0034675 + 
1°0034358 
1°0034046 
‘0033739 
0033439 
0033143 
"0032853 


0032567 


*0032287 
‘003201 1 
*0031740 
*0031474 
“0031212 


1°0030954 
10030701 
10030452 
1 ‘0030206 
10029965 


1 °0029727 
1°0029494 
10029264 
1°0029037 
10028814 
“0028594 
"0028378 


"0028165 





"0027544 
0027343 
"0027145 
"0026950 
“0026758 


+ 


1 0026568 
10026381 
10026197 
1°0026015 4 
10025836 
1°0025659 
1-0025485 
l 





2 
o 
2 
° 


10024976 


‘0037266 
36900 
36540 
2G1R8 
35842 
35502 
35169 


30832 
30581 
30333 


30090 


29850 
29615 
29383 
29154 


28929 





26292 
26109 


20929 


25751 


25h75 - 





+ 


c 


“0000615 — 





6 ~ 


603 776 
591 761 
580 747 
569 733 
719 
706 
693 
680 
668 
09 656 
YOO 644 
1] joed 
{82 622 
174 612 
166 601 
458 591 
150 + 81 
443 571 
435 + 62 
428 553 
12] 544 

$14 535 + 
108 527 
101 518 
395 510 
389 502 
383 194 
377 187 
371 179 
72 

165 — 
158 
151 

15— 
138 
132 
126 
120 
$14 
108 
! 102 
306 397 
302 391 
298 386 
381 

375+ 
370 

365 + 





Cho 


“0000048 
16 
45 
44 








» 


5 + 


“0000033 
32 
3 
30 


29 


28 


be bo bt te 


wwe © 


bo 
tbo 


te te te 


bo Ww 


mr bo bo bo 
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TABLE I (continued). 


Values of the Coefficients. 








n Co Cy Cy Cg Cry Che 
302 1:0024810 “0024896 ‘0000275 ‘0000356 “0000014 “0000010 
Yi 1 0024647 24732 271 351 14 10 
6 10024486 24570 268 347 14 9 
8 10024328 24410 264 342 13 9 
310 1°0024171 24252 261 338 13 9 
312 1°0624016 24096 257 334 13 9 
Y 10023863 23942 254 329 13 9 
6 1°0023712 23790 251 325 12 9 
8 1°0023563 23640 248 321 12 8 
320 1°0023416 23492 245 317 12 8 
322 | 0023271 23346 242 313 12 8 
j 1:0023127 23202 239 309 12 8 
6 1 0022986 23059 236 306 1] 8 
8 10022846 22918 233 302 Ll 8 
330 1 0022707 12779 230 298 1] 8 
32 1°0022571 22641 227 295 — 11 7 
Y 10022435 + 22505 + 225 - 291 11 7 
6 1°0022302 22371 222 288 LO  f 
8 1°0022170 22239 219 284 10 | 
wD 10022040 22107 217 281 10 7 
4.2 1°0021911 21978 214 10 7 
y 10021784 21850 — 212 10 7 
6 1-0021658 21723 209 72 10 7 
8 1°0021534 21598 207 268 9 6 
350 1:0021411 21474 205 65 9 6 
352 10021289 21352 202 262 9 6 
Y 1-0021169 21231 200 9 6 
6 10021050 + 21112 198 9 6 
8 1 0020933 20994 195 9 6 
360 10020816 20877 193 8 6 
362 1 0020702 20761 191 8 6 
4 1 0020588 20647 189 8 6 
6 10020475 + 20534 187 8 6 
8 10020364 20422 185 8 5+ 
S70 1°0020254 20311 183 8 5+ 
372 1°0020145 20202 181 235 + 8 + 
l 1:0020038 20094 179 233 8 5+ 
6 10019931 19987 177 230 7 a+ 
8 10019826 19881 175 228 7 5+ 
380 1°0019722 19776 173 225+ 5 5— 
382 10019619 19672 172 223 az 5- 
l 1:0019516 19569 170 221 7 o— 
6 1°0019415 + 19468 168 218 7 5 
8 1°0019315 + 19367 166 216 7 a— 
390 1°0019216 19268 165 214 7 5- 
392 10019118 19169 163 212 7 } 
) 10019021 19072 161 210 6 4 
6 10018925 4 18975 4+ 160 208 6 1 
8 10018830 18880 158 205 6 4 
400 10018736 18785 + 157 203 6 1 








. + ’ °@ 
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TABLE IL. 


Values of the 12th Incomplete Normal Moment Function. 


x 


a 


1° 
1° 
1° 
1° 
1° 





My (x) 6 


“OOOO0000 
“0000000 
*OOOO000 
“0000000 
*QOO0000 


“OOOO000 
*QOOQO0000 
*OO00000 


000000 1 + 4 
“0000006 7 
“0000018 30 
*OO00060 68 
*OOOOLTO 152 
*C000432 315 
*O001L009 601 
‘0002187 1072 
‘0004437 L799 
‘0008486 2860 
‘0015395 — 1312 
‘0026616 6206 
0044043 8544 
‘0070014 11274 
‘0107259 14287 
‘0158791 17413 
*0227736 20426 


My (@) 


‘0317107 
‘0429539 
‘0567028 
‘0730674 
-0920490 


*1135283 
*1372638 
"1628998 
*1899840 
*2179947 


*2463711 
*2745471 
*3019840 
*B281988 
*3527866 
3754351 
*3959320 
°4141636 
"4301077 
"4438212 


*4554252 


*4650882 


*4730086 


*4794006 
*4844809 
"4884582 
*HOOOOOVO 


+ 23061 
25057 
26157 
26170 
24977 


22562 
19005 
14482 
9265 
3657 


— 2004 
7391 
1222] 
16270 
19393 


21516 
22653 
22875 
22306 
21095 


19410 
17426 
15284 
13117 
11030 


9097 


































ON THE MOMENTS OF THE DISTRIBUTION OF SQUARED 
STANDARD-DEVIATIONS FOR SAMPLES OF NV DRAWN 
| FROM AN INDEFINITELY LARGE POPULATION. 


By A. E. R. CHURCH, M.A., B.Se. 


(1) BEING engaged in an experimental investigation of the moments of the 
distributions of means and squared standard-deviations (o*) of samples of V drawn 
from a definitely skew population I have had occasion to resort to the formulae for 
the moments of the distribution of o? for samples of V drawn from an indefinitely 
large population given by Prof. Al. A. Tchouproff in Biometrika, Vol. x11. p. 193 (23) 
and p. 194 (24) and (25). 

Considerable time was spent in an attempt to apply these to the actual distri- 
butions used, and finally Prof. Karl Pearson, under whose direction this experimental 
work is being carried out, came to the definite conclusion that the formula (25) for 
the 4th moment was in error, I read through the algebraic manipulation which 
leads up to these numerical results but was unable to follow its final stages where 
the statement is very exiguous. 

Now, previously to Tchouproff, “Student” had given the first and second of 
these moments in Biometrika, Vol. vi. p. 3, although unfortunately, the second 


I moment contained by a printer’s failure slight errors corrected, however, in the 
| following line. “Student” also gave the 3rd and 4th moments by the same process 
in the case of samples from a normal distribution. 

In the present paper I have extended “Student's” method to obtain the 3rd 
| and 4th moments for samples from any population so that I might have a revision 
of Tchouproft’s 4th moment by an independent method. I further desired to curtail, 
| if possible, the somewhat extensive use of symbols employed in Prof. Tchouproff’s 

memoir, which discourages the English statistical student. 

(ii) Let », be the tth moment of the original indefinitely large population of a 

variable X, where X is measured from the mean of this population, 

Let ,M, be the tth moment, about X = 0, of the distribution of o* of samples of 

N drawn from this population. 

Let ,J/, be the tth moment, about its own mean, of this distribution of o°. 

Then en % aa {SX)\?]F 
} . N ee Ss 


where S refers to the summation over the NV values X,, X.,... Xy of the variable 
X which form one sample, and where, since the original population is indefinitely 
large, these V values of X may be considered independent or uncorrelated ; while 
by = is meant the process of summing for all possible samples of WV and dividing 
by the number of such samples. 
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To obtain the first four moments ,M,’, .M,',,M;, .M, it is necessary to expand 
a number of expressions of the form {S(X,)*}” {S(X,)}*, where p and q are zero or 
positive integers less than 8, in series of sums like S(X,*X/... X,*), where 
a, 8,... and r are positive integers or zero, each series of sums being isobaric in 
the X’s of weight m, where m = 2, 4, 6, or 8. 

Thus I find 


{(S(Xy)P= S(X")+28(X2X-), 
(S(Xy)s= S(X,°)+3S8 (XX) +68 (XL2NZX), 
{(S(Xy)}t= S(X8)+48(X°X7) + 12 8 (XAX2N,7) + 6 SCX AN.) 


248 (X2X2X2X,), 
{S(X,)?= S(X2)+2S8(X_X,), 
{S(X, )i4 


lI 


S(X,4)+4S(X¥2X.) + 6 S(X,2X,") + 12 S (X,2N.,X;) 
24S (X,X.X,X,), 
(S(X,)*= S(X,)+6S(X5X,.)+15 8 (X¥' X27) +30 S(XAX,X;) 
+ 20 S(X,3X,3) + 60 S (X,°X2X,) + 1208S (X7.X,X,;X,) 
+90 S(X,2X.2X,7) +180 S(X,7X.2.X;X,) + 360 8S (X2X,.X,X,X;) 
+720 S(X,X,X,X,X;X,), 
{S(X,)}§= S(XS)4+8S(X,7X,) + 28 8 (X°X2) + 568 (XNX_X;) 
+ 56S (X5°X.*) + 168 S(X5X.2X,) + 336 S (X5X,X,X,) 
+ 70S (X,*X,.*) + 280 S (X,4X.2X,) + 420 S (X 4X.2X,?) 
+ 840 S (X1X.2X,X,.) +1680 S (XAX,X,X,X;) 
+ 560 S(X 2X3X,7)4+11208S(X3X2X;X,)+ 16808 (X 2X2X.2X ,) 
t 3360 S(X3X2X,N,X;) +6720 S(XPX,X,X,X,X,) 
+ 2520 S (X 2X 2X,2X 2) + 5040 S (X2X27NX27N,X;) 
+ 10080 S(X 2X2NX,NX,N,X,) + 20160 S (X2NLN,X,N,X,X;) 
+ 40320 S(X,X.X,X,X,;X,X,X;), 

(S(XPrIS(X)P= S(X,°)4+28(X2X.) +38 (XAX2) +28 (XAX,X;) 
+45 (X3X.3) +45 (X3X2X;,)+ 6S (X2X2X,7) 
+45 (X2X2X,X,), 

(S(Xy)PIS(X,)}= S(XS)+4S8(X,7X,) +8 8 (XX) + 12 8S (XXX) 
+128 (X,°X.*) +20 S(X)X2X,) + 248 (XX. X,X,) 
+149 (X,5X.") + 32 S (X,1X.2X,2) +28 S (X1X.2X,) 

+ 36 S(X'X2N,X,)+ 24S (X1X,X,X,X;) +408 (X; 

56 S (X,°X.2X.2X,) + 48 S(X2X3X,X,)+48 8 (X 2X? 


f 


X2X,7) 
AX yX4A5) 


+ 
+72 S(XPXZX2X 2)+ 72 8S (XPXZXZX,X;) 


48 S(X2X2X,X,X5X¢), 


(S(XY)P{S(X,)2= S(X"%)+2S8(X7X,)4+4S( XX) +28 (XX, X;) 
+6S(X5Xf)4+ 6S (XPXLAN,) +68 (XX) + 6 S(XAXEY;,) 
+128 (X{X2X3)+6 S(X'X2X,X,)+12 8 (X,7X2X,') 
+12S(X2X27X7X,) + 248 (X2X3X2X 2) 
+129 (X,2X2X2X,X;), 
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(S(XY} {S(X)jt= S(X)+28(X2X,) + 29(X2NX2) 42 9(X2X.X,), 


(S(Xy)} {S(X)#= S(X) +48 (XX, +7 8 (XX ye) + 12.8 (XX, X,) 
+88 (XeX8) +16 S(XeX2X,) + 24.9 (XPXX,X,) 
+18 8(X2X 2X2) + 24.8 (X2XEX,X,) +248 (XXX, X,X,), 


{S(Xy)} {S(X,)}*= SCX")+6S(X7X.) + 16 8S (X,5X.2) + 30 8 (X°X,X,) 
+ 26 S(X,°X,*) + 66 S(X°X2X,) + 120 S(X,5X.X,X,) 
+30 S(X,4X,*) + 90 S (X,AX.2X,) + 120 S (X,4X.2X,2) 
+210 S(XtX2X,X,)+ 360 S(XAX,X,NX,X,)4+140S8(X2X2X,7) 
+ 300 S(X?X2X2X,) + 240 S(X2X3X,X,) 
+ 480 S(X3X2X,X,X;) + 7208 (X2X,X,X,V,Xe) 
+360 S(X2X2X2X 2) + 540 S(X2N2NX 2X, X,) 
+ 1208 (AVAGAA Ag Ag) + (208 (4A G32, XAG, 4). 


Now if a, 8, y,... be all different, S(X,7X¥...X,4) has N(N—-1)...(N—r+4+1) 
terms in its expansion; but if p indices be alike of one kind a, and q be alike of 
N(N-1)...(V-r +1) 


terms in its expansion : 
Pp : q 3 cece 


. SS(AswX... XA)=N(N-1)...(N—r4+1)> (X°X¥... X/), 
1(N—1)...(N-—r4+]l) wapviy — : 
or <¢ a “= >> XaX eee Xt XP... XP 


another kind £, ete., it has 


ptg-**s» 
as the case may be, where >’ refers to the summation over all values of the variate 
X in the original population, divided by the number of such values of course. 


Now since X,, X,,... Xy are independent or uncorrelated we have therefore 
> S(XwXf...XA)=N(N—1)... Nerv 4+) Xe XS... SX, 


T(N-—1)...(N—r+1 —— Rae 
em N(N ) ( r+ is X)e(S' X)..., 


Oe! eee 
that is DS (X 2X8... Xf) =N(N-1)... (N— 14+ 1) ig ... rr, 
N(N-1)...(V-r41)_._, 
or a Va" Vp? ... 5 
ae ae 


but since these moments of the original distribution are about its mean X = 0, we 
have p, = 0, therefore if any of a, 8, ... X are unity, & S(X,*X.8 ... XA) = 0. 


Therefore, using the above, we have 


S(X,) {8S (X,)}*] 7 ( r) 
——_ = ‘eis 5 


ij = 
MM =>, A A V 





yeas [SDP 2S WS CDP LS GD) 
also M, =% — 7s — N3 + N: ; 


~ ee | l i, She (1-5 3 \ a6 
Le. oM, =(1-) (2 y) t+ l wt y:)” ; 
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Further 
vies | WS (X,7)}* 3 {S(X,*)}? {8 (X,)}? : 3 {S(X,)} (S(X,)\* _ (S(X,)}*| 
243 =_ N? N4 N 5 Né sd 
ee, ek Pee ee remem nn 
Le. »M, =v, + >-[30,0, — 60."] 4 ye Ls — 129,09, — 60,7 + 207.5] 
l 9Q- 2s = — 9 = 3 l 9Q- 92> 5 9952 »Q— 3 
~ W: (30, — 300,0, — 180,’ + 48p,"] + yi [3% — 369,0, — 229,? + 630.7 | 
Ec ps4, ss el eae 
— [%— 15p,0, — 1002 + 307,°], 
Again 
M=3 | S(X,*)" _4 US (X1°)}? (SCX)? ‘ 6 US (X2)}? (S (X1)}* 
: 4 N . N . N . 
_4 {S (X,?)} iS (Xi)} {S(X,)}* 
N’ Ns 
. (i=4 re = 47 l - = Q-2 95 52 94525 RQ— 4 
ie. oM/ =pA+ y L602 — LOv."] + V [40,0 + 302 — 420,02 — 240.2 d, + 537,"] 


] 
+ .. |b — 200,0. — 249, 0, — 1502 + 1809, 0,2 + 1929.20, — 2187."] 


— ysl ip, ~ 640, 0, — 960, 0, — 5402 + 5760.02 + 688720, — 6877.4] 
+ + |6p,— 1120, 0, — 1760.0, — 10207 + 11229,0.? + 13600," 7, — 13985."] 
- vel 47, — 920, 0. — 1607, 0. — 9502 + 11107,0.2 + 14007,20, — 15157,*| 


+ .,.[0, — 280,0. — 560, 0, — 3502 + 4200, 0.2 + 5607.20, — 6307,"]. 
iV 


Now changing the origin from the mean of the original population to the mean 
of this distribution of o°® of samples of N we have for the equations of trans- 
formation the usual formulae : 


M.=.M,' —(.M,’¥, 
,.M,=.M, —3..M,..M,' —(.M,¥, 
»M,=.M/—4.,M,..M/—6..M,.(.M)'¥ - (.M,’), 


which give 


Db 


M, = 75 [Be — 30,0, — G93 + 20,"] — 47, [3D.— 210,0;— 1852 + 269,"] 


l a ” he 3 ” = 
' ya L3% — 3300. — 220.2 + 549,°|— 5. [d, — 150,07, — 100.2 + 307.2] (11), 


“ 
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and 
» 


: ae — es we _— o Ses << = 
~M,= a [p,— v2 + 73L¥s — 40402 — 240, 0, — 1502 + 487,07 + 960,70, — 307,*] 


N 


i : 
— yal 4s — 407, 0, — 960,07, — 5402 4+ 3360,0.2 + 5287.77, — 3067,*] 
l po eee ” = cea m 
+ Ns [ 67, = YO, D2 — 1706p. Vs — 1020, oo 9240 ,02 ote 1232p.7v, oe 10447.4] 
4 
1 = oO- = “~y\- - em ¢ KA- -2 NCA - 2 9Or- 
~ a [40, — 88,7, — 1600.0, — 950, + 10500,0,7 + 13600,?0, — 13959,"] 
4 


i ie tale, Fg. . Sse : rae 
+ Vi [¥, — 280, 0, — 560, 0, — 3502 + 4207,02 + 5607.27, — 6307.4] ...(i11). 


A comparison between the formulae given above for ,M,’ and ,M, with Tchou- 
proff’s formula (25) appears to indicate some algebraical slip in his substitution of 
the value of his Weew of equat ion (21) of p.193 in the third equation of his (10) p. 187 
to get his (25) of p. 194. For his coefficients in the terms of (25) involving 1/N*® 
are exactly those of 1/N* in his Wsivy Or of the term in 1/N* in my .M,, and not 
those of this fourth moment, our ,M,, when transferred to the mean o? of samples. 

The method, due to “Student,” employed here is undoubtedly more direct and 
concise in statement than that employed by Prof. Tchouproff ; further it presents 
fewer algebraical difficulties and the algebra has the advantage of being easily 
followed and checked at each stage. 

It is difficult to estimate at present the real value of these high moment 
formulae in actual practice, but it is fairly clear that, if they be used to predict the 
results of any actual sampling, that sampling must be strictly in accordance with 
the assumptions involved and must be very carefully carried out in experimental 


work before any reasonable agreement ean be obtained. 


In conclusion I must express my thanks to Prof. K. Pearson for his continual 
kindly advice and criticism. 








THE CORPUSCLE PROBLEM. 
A MATHEMATICAL STUDY OF A BIOMETRIC PROBLEM. 
By S. D. WICKSELL, Lund. 


l. IN several organs of the human body, as well as that of many animals, 


a microscopic and sometimes also a macroscopic inspection reveals a number of 


small, more or less regularly formed corpuscles embedded in the tissue of the organ. 
Such corpuscles are for instance the Germ Centres or secondary follicles (Hellman) 
found in lymphoidal organs—as the lymphatic glands, the tonsils or the spleen,— 
the so-called Hassul corpuscles in the thymus, the Langerhans insulae in the 
pancreas, etc. Generally these bodies vary considerably in size within the same 
organ, and in different organs or different individuals the number, as well as the 
size distribution of the corpuscles will be found to be different. 

In anatomy as in other biological disciplines the last few decades have seen an 
enormous increase in the demand for exact and detailed information. The modern 
anatomist often devotes the labour and toil of many years to the study of one 
special organ, patiently collecting and measuring extensive material from animals or 
post mortem subjects. The more extensive the material the more difficult it will be, 
however, to perform for each individual all the detailed observations required ; the 
investigator will have to look for simplifications and short-cut methods of observa- 
tion, as far as this is possible without making the resulting information insufficient. 
In studying the number and size distributions of the above-mentioned corpuscles, 
the anatomists have, then, run up against a mathematical problem, which is, even 
in the simpler cases, beyond the powers of persons unacquainted with higher 
mathematics. This problem, which I call the “corpuscle problem,” is, as far as 
I know, as yet unsolved, although several anatomists have tried to reach a solution 
along more or less purely empirical lines*. At the request of Professor T. Hellman, 
of Lund, I have therefore taken the question up from the mathematico-statistical 
point of view. My interest in the matter has subsequently been greatly increased 
by the rather curious fact, that the problem is in essential features identical with 
an important problem in astronomical statistics. 


2. In order to make the matter more concrete I shall begin with a short 


deseription of the observations performed by Hellman regarding the so-called 


secondary follicles (Germ Centres) in the lymphatic tissue of the spleen. 


See for instance J. A. Hammar, “ Methode, die Menge der Rinde und des Marks des Thymus, 
sowie die Anzahl und die Grisse der Hassal’schen Kérper zahlenmiissig festzustellen”’ (Zeitschrift f. 
angewandte Anatomie und Konstitutionslehre, Bd. 1, Heft 4/5, 1914); J. A. Hammar and T. Hellman, 
‘*Kin Fall von Tyreoaplasie unter Beriicksichtigung der innersekretorischen und lymphoiden Organe : 
Abteilung II: Die !ymphoiden Organe,” von T. Hellman (Zeitschrift f. die gesammte Anatomie, Zweite 
Abteilung, Bd. vitt, Heft 4, 1922). 


—~ 


— 





— 
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The Germ Centre, or rather the secondary follicle in the Malpighian corpuscle, 
is a small, approximately spherical, body embedded in the tissue of the spleen. The 
number of such corpuscles in different spleens is very variable, ranging from a very 
few or none at all up to hundreds of thousands per organ. The sizes are also very 
variable, the same organ often containing all sizes from a small fraction of a milli- 
metre up to one millimetre in diameter. 

The observations are arranged in the following way: The weight of each spleen 
being ascertained, and fixation and preparation being properly attended to, micro- 
tome sections of 0°018 mm. in thickness are taken from different parts of the organ. 
By the aid of a projection apparatus a picture of each section is obtained in 18-fold 
magnification on a paper, and the contours (generally circular, sometimes oval) of 
the follicles are traced on the paper in lead pencil. On the drawings thus obtained 
the diameter of every corpuscle contour is measured to the nearest millimetre ; when 
the image is not circular the largest diameter and also the diameter at right angles 
to the largest are measured, the arithmetic mean of the two measures being taken 
as value of the diameter. When the diameter is below 1°5 mm. in this magnification 
the measures are rejected on account of difficulties of identification. From the 
protocols kept of these measures the distribution of the diameters of the images in 
the section is obtained for classes of one mm. breadth (1°5—2°5, 2°5—3°5, ete.). 

The outline of the preparation also being marked out in the drawing the parts 
of the paper representing the spleen parenchyma may be cut out as well as the 
images of the follicles. As a carefully selected homogeneous paper is always used 
the areas of the sections may be determined by weighing the drawings. The weights 
of the drawings before and after the images of the follicles have been cut away will 
also give sample values of the ratio of the total volume of the follicles to the total 
volume of the spleen. As wili be seen later on it is necessary to know either the 
area of the section or this ratio in order to determine the total number of follicles 
in the organ. Of course, when several sections have been taken and measured from 
different parts of the same organ, they are put together and regarded as one single 
section of correspondingly greater area. In the following we will regard this area 
as great enough for the purposes of generalization. Thus the question of the errors 
of sampling will not be taken up in this connection. 


It is furthermore clear that the diameters of the images in the drawings are 
generally not the true diameters of the follicles. Only when the section chances to 
pass through the centre of a follicle will this be the case. As a rule the centres of 
the follicles will lie more or less distant from the plane of the section. The more 
distant the centre the smaller will the image diameter be, in comparison to the 
actual diameter. As the distance of the centre is generally unknown we see that 
the diameters measured in the drawings are “apparent diameters” in the same 
sense as the projection on the heavens of the motion of a star or the distance of the 
components of a double star is called apparent, the inclination to the plane of 
projection of the true motion or line joining the components being unknown. 
Indeed, the difficulty of the anatomist at his microscope is essentially the same as 
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the difficulty of the astronomer at his telescope; he only sees the projection of 
things. The statistical problem met with is also in both cases similar. The “ true” 
character being a function of the apparent character and an unknown variable, and 
making a plausible assumption as to the distribution of this variable, the problem 
is to express the distribution of the true character in terms of the observed distri- 
bution of the apparent character. It will be seen later on that there is also one 
problem in stellar statistics, which is in essential details identical to the anatomic 
corpuscle problem, namely the problem of determining the distribution in space of 
the stars in a stellar cluster, from the projected distribution on the heavens. Hence 
some of the methods here worked out for anatomic applications may be of value 
also for the astronomers. 


Evidently the problem arising in connection with the secondary follicles of the 
spleen is the following: To find the distribution of the actual (“ true”) diameters of 
the follicles from the distribution of the diameters of the images (the “ apparent ” 
diameters) in the microtome sections. That these two distributions are generally 
different is clear from the following two reasons: (1) The apparent diameters are 
as a rule smaller than the true diameters, and (2) a random section will contain 
a relatively greater number of actually large follicles, than of small ones, because 
the former more frequently will come within reach of the section plane. The two 
causes will, however, work in opposite directions, thus having a chance to balance 
each other. It will be seen in the following that under certain conditions an exact 
balance takes place, making the relative distributions of the true and the apparent 
diameters equal. 

We must mention finally that in taking a series of successive sections instead 
of only one or a few sample sections, the true diameters may be directly observed. 
In this respect the position of the microscopist is more favourable than that of his 
colleague at the telescope. But as the measuring of one single section is generally 
very laborious, and as the material studied embraces several hundred spleens, it has 
been found impossible to use this method, except in special cases to obtain a control 
and empirical test of the theoretical reductions. These reductions naturally in- 
volve the assumption that the corpuscles are approximately spherical in form, and 
that they are fairly uniformly distributed throughout the whole organ. Series of 
successive sections actually measured show these conditions to be sufficiently ful- 
filled in the case of the secondary follicles of the spleen. Attempts have been made 
to use the serial sections for obtaining a set of purely empirical equations of 
reduction. These trials have, however, shown that the theoretical equations are 
more to be relied upon. 


I now pass on to a more general formulation and solution of the “corpuscle 
problem.” 


3. The Corpuscle Problem. In an opaque body there are suspended a large 
number of spherical corpuscles of different sizes, the density of the centres and the 
distribution of the sizes being the same in all parts of the body. This body is split 


© 


in two by a plane section. To express the distribution of the diameters of the 
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corpuscles in terms of the distribution of the diameters of the circular contours 
found in the plane section. 

Solution. Let the number of centres per unit volume be N and let F'(7r)dr be 
the relative frequency of corpuscles with the diameter in the interval r to r+dr. 
We now have 


°R 
bg (Cs GS hee ena SE Set (1). 
/~ O 


R being the upper limit to the diameters. Furthermore the number of corpuscles 
of diameter r to r+dr coming within reach of an arbitrary plane is 
Nr F(r)dr 
per unit area in the plane. Hence, r, being the mean diameter of the corpuscles: 
Nr, will be the total number of corpuscle images to the unit area in the section 
plane. Putting 
BEFORE IG) civics (2). 
it is seen that f(7)dr is the relative frequency of corpuscles cut by the plane. 
having the actual diameter between r and r+ dr. We thus have 
rF(r) = 
(y= Faia /aiere(rharsles ovis ewieieiwiaisieine sso oes eeiem (5). 


Vo 


The probability that a corpuscle with the diameter r, falling within the reach 
of the plane section, has its centre at the distance y to y+dy from this plane, is 
evidently equal to 

2dy 
at 

Hence the relative number of corpuscles of diameter r to r+dr and with their 
centres at the distance y to y +dy(y < r) from the plane, is equal to 


9 


~ f(r) dy dr. 
at 


But the “apparent” diameter of the circular image cut out of a corpuscle by 
the plane being denoted by , we have 
eo — 4°, 
and it is seen that the relative number of corpuscles having the actual diameter 
r to r+dr and the apparent diameter # to « + dw in the section plane is 
2 1 l ; 1 } 1 rdax 
ap ay . va . x a: 
fi(r)dr F hie im: h(r)dr —- = F (xr) dr a Ce ceercccces (4). 
im da er fy2— G2 No V7? — a2 
Hence, (x) being the relative frequency function of the apparent diameters in 
the plane section, we have 
. £2 Tae 
a . a » 
(x)= |e ee ES (5). 
Toda Vir — 2 
As ¢(«) is here the function supposed to be known, this is an integral equation 
to determine the sought function F (r). 
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This integral equation may easily be transformed to the same form as the well- 
known integral equation of Abel*, and thus the solution is readily obtained. It 
will be still more convenient, however, to apply the solution given by von Zeipel of 
the following equation, occurring in the reduction of the observations of globular 
clusters , 


For this equation von Zeipel has obtained the solution 


1 pew (e)de 1 per 
ieee ee “or Fall Wd Oke oo} | Sr a ape (7), 
Ir (a 
where P(«#)=- + 4 J 
a da 


The solution of (5) consequently assumes the attractive form 





o. : Opp (NW Rr? 
ns, da ree | P(VEP + r?) dl eel), 
7 Jy da Ve—72 wT Jo 
/d (a) 
1 d {| -— ) 
where Sg eg SS esse (9) 
v da 


4. This form of solution is also specially adapted to the case, in which we do 
not know an analytical expression for (a), but only have a series of numerical 
values of this function, which is what takes place both in the astronomical problem 
as well as in the anatomical problem here in question. 

The following method will certainly be sufficiently accurate for the anatomical 
applications: The observations in the sections having been arranged in classes, 
the relative frequency-histogram of the apparent diameters may be constructed. 
Smoothing this by a continuous curve (a free-hand curve will suffice) we may read 
off the ordinates (7); (¢=0, 1, 2,... R). Thence we obtain the series o(1):1; 
(2): 2; (3): 3, etc, and finally the differential quotients of (x) 


:@ may be 
obtained by the aid of the well-known central difference formula 


FS’ (a) = Ad f (@) — 4 AF f(x) + gy APS (a) + .... 


From these we easily obtain the values of P (7), and the curve y= P (w) may be 
plotted, so that we may read off the values of 


P(V? +P): (c=). 2... BR): 


* See for instance Bocher, An Introduction to the Study of Integral Equations, pp. 6—11 (Cambridge 
Tracts in Mathematics and Mathematical Physics), 1909. 

+ H. von Zeipel, ‘Catalogue de lamas globulaire Messier 3” (Annales de UV Observatoire de Paris, 
Tome xxv, 1906, p. 30); ‘Recherches sur la constitution des amas globulaires” 
Vetenskapsakademiens Handlingar, Bd. 11, No. 5, 1913, p. 9). H. von Zeipel und J. Lindgren, 
‘* Photometrische Untersuchungen der Sterngruppe Messier 37” (Kungl. Svenska Vetenskapsakademiens 
Handlingar, Bd. x1, No. 15, 1921, pp. 109—110). 


(Kungl. Svenska 
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Using the approximate quadrature formula * 


VRRP vo 
> P(WF+r)-4 P(r)] = A(r) ...(10), 


Tilo 


or (Rae ss 2; 
= : PWE+r)dl=—" 
T 





i=0 
we have F(r)=17,A (r). 
>R 
Since Ei(vjdr=1, 
“0 
' i <= - . . 
we evidently must have approximately — = % A(r). The mean diameter r, being 
“ To r=0 ; 


found in this manner, a graph of the sought function F'(r) is easily constructed. 
To determine the total number N of follicles per unit volume we may use the 
results obtained by weighing the section-drawings. These weighings give, as 
mentioned in § 2, the volume of the follicles in percentages of the spleen tissue. De- 
noting this by 100 V, and the mean volume of the follicles by v,, we evidently have 
N=V:y. 
The mean volume 2 is obtained from the equation 
ki 
T ‘ 
Uj= = = r F'(r). 
0 »=0 


The weighings of the drawings may, however, also be used to determine the 
area of the observed sections. This area being denoted by Y, and the number of 
follicles represented in the observed sections being denoted by n, we evidently have 

n=WNr,Y, 
from which equation NV is readily found. It may be presumed that this method is 
subject to a smaller mean error than the preceding method. 


5. Another solution of (5), which is perhaps more to the taste of the trained 
statistician, is obtained by the aid of the method of moments. 

The quantities v,’ and N,’, defined by the equations 

R oR 
we = [ a h(x) dx; N,, = r F'(r) dr, 
~ 0 “0 
are known as the moments about the origin of the frequency functions ¢ («) 
and F(r), As is well known from the papers ef Professor Karl Pearson, it is 
generally possible to construct the frequency function, when the numerical values 
of a sufficient number of the moments have been obtained. As a rule the first four 
moments, in exceptional cases the first six moments, are sufficient. 

Now, the moments v,' of @(#) may be numerically determined from the 
observed distribution of the apparent diameters. The mechanical quadrature 
formulae for this determination need not be entered into here, as they are familiar 
to any statistician. Hence the solution of (5) is reduced to the following problem : 
To express the moments NV, of the sought function F (7) in terms of the 
moments Vy’. 


* von Zeipel has used this formula in his cluster reductions, where the accuracy required is much 
greater than in the case of the anatomical applications. 
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Indeed, one such relation may be written out at once. As the plane section 
. T, m 4 * «) > “4 T r, 7 , 
contains Vr, corpuscle images per unit area (see § 3), it is clear that V.N, Ue 


(the mean diameter 7, being here denoted by N,’) is the area of the images per unit 


e . 7 T a > 
area in the sections. Furthermore V A N,) is the volume of the corpuscles per 


J 
unit volume of the tissue. As these two quantities are evidently to be put equal 
to each other, we have 
N, =$ N,'v,. 


A general solution is obtained as follows: We have, putting r,= N,' in (5), 


w fk dr 
QO (4) == F(r) 
4 N, }, NVir-—@ 
; 1 se a dr 
Hence Vn == amt! dz : F (r) 
Ny Jo Jz Vir—a 
Using Dirichlet’s formula* this reduces to 
; l rR : ry grt 
vy, = =, F (rv) dr | ; ye 
N, al) JoVver—v 


f rr gti 
Putting Yn = : dx, 
J0 NI" — “Xe 
it follows from well-known theorems in the theory of integration, that we have 
T . , oT ee wie y 16 7 
Y=7r, y= Mm, Yo=tr”’ s,s ULES I S Yg Heer’ S Us =: ag" 6 = a5!". 
y n= GMs WHIP; WHT; WM) W= gq Ms Yrs 
Hence we finally get 


NY’ = N,’v,' (i.e., as should be, »,’ = 1), 


ry 4. yr oe ry . a rr 16 re 2 

Nz, =-—N,v,; Ns =3No; Ni=— Miu; |... (11) 
T s oT q 

N, =32 N,'v,; N, =— Nir; ; N= 36 Ni % 


It thus only remains to determine N,’, and the problem to express the “ true 
moments ” in terms of the “apparent moments” is solved. 


In order to determine NV,’ we may for instance make use of the proposition : 
The arithmetic mean of the true diameters is equal to thre harmonic nrean of the 


apparent diameters multiplied by m/2. The proof of this proposition is evidently 


PR dx fk pa dr rR fr dx a [* _, T 
N, p (x) = da Fi(r) . a F (r) dr : =5 F(r)dr = >: 
J0 a J 0 Jz \ r? — a" J 0 JON — av =~J() = 


The moments of the sought function F'(7) thus being obtained, the construction 
of the function only depends on a suitable choice of an analytical expression. I 


* As to the applicability of Dirichlet’s formula when the integrand becomes infinite at the limits of 
integration see Bocher, loc. cit. p. 4, 
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believe that Pearson’s theory of frequency curves and the auxiliary tables given by 
him for the construction work, give ample means to meet any requirements in this 
respect. 

It may be added, that the general relations between the moments NV’ and v’ may 
be expressed as follows : 


- ee eee T ; 
N n+19 : : >» When n is odd, 
Niy,’=4 2.4.6...(n4+1) 2 
oe, 
1°R 
= 2.4.6... ’ F 
N uaa = eS , When # 1S even, 
BO. d cael 2) 


6. In § 2 it was pointed out that under certain conditions the true distribution 
F' (r)and the apparent distribution ¢ (#) become identical. In order to show this we 
put in (5) $(v) = F(~), thus getting the equation of condition 


, a [®, ay 
F (x)= F (r) — 
Tole Vr — a 
Putting F(v) : «= F(a), we get instead 
: [ fe. r dr LPN = 
F, (x)= Fi(r) 5 = F, (Va? + y) dy. 
Tola V7" — a rolo 





[t isseen at once that this equation is satisfied by the function 
- x 

F. (x)=ke 2, 

if, as in this case must be done, we put A=. Hence as we must also have 


x 


Fi(r)dr= L, 


77 


ro =o \/ 2 . 


Our conclusion is that if the true distribution has the form 


and 


F'(r) € ao 
o 
then the apparent distribution has the same form, or 


0) (2)= = e Zo, 


o~ 


That this result is true also in the inverse sense may be shown by the aid of 
the solution (8) of (5). 


It might at first seem as if the same property is at hand when we have the more 
general form 
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It is easily found, however, that in this case we get 


n r ial n ky 
, ; 4p 
F(r)=> ky, e 27; —. 
1 n 1 Tn 
7. In §3 we denoted by 
3 ; 
f(r)= ©) 


the relative frequency function of the true diameters of the corpuscles, represented 
in a random section. Introducing this function the equation (5) takes the form 


PR dr 
g(a)=a} f(r) ——. 
2 rTVvr—- sf 

In this form the equation is valid also if the corpuscles are not spherical but of 
ellipsoidal shape. The different corpuscles need not be of the same form, nor must 
the axes of the different ellipsoids be parallel, provided that as “diameter” of a 
plane section out of any corpuscle we take the geometric mean of its largest and 
smallest diameters. The apparent diameter x of a corpuscle should thus be defined 
as the geometric mean of the largest and smallest diameters in the image obtained 
in a random section, the true diameter 7 as the geometric mean of the largest and 
smallest diameters in the central plane parallel to the section plane. 

This rather remarkable fact follows from the following proposition: Take any 
three-axial ellipsoid and place it with the principal axes pointing in any direction. 
Two horizontal sections are taken in this ellipsoid, one passing through the centre, 
the other passing at the distance y from the centre. The areas of the two sections 
we denote by A, and A,. The distance of the horizontal tangent plane from the 
centre we denote by h. Then we always have 


y 
Ay= A,( -i). 


The truth of this proposition follows immediately from the property of an ellip- 
soid, that the areas of parallel sections are proportional to the areas of similarly 
situated sections in a sphere. 

Now, defining the apparent and the true diameters in the above way, we have 

mwr=4A,; wa?=4A,, 


and it is seen that we have the relation 


“=r JI - y 


y NVIP— a 
or Y= 
h 1 
da a“ du 
But x are 
lh da rv 7? — a 


is the probability that an arbitrary section in a corpuscle of central “diameter” r 
has an apparent “diameter” # to «+dz. Hence this probability is identically the 
same as in the case of a sphere. 
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8. It was mentioned above, that the “corpuscle problem” has a parallel in 
stellar statistics. This parallel is the “ globular cluster problem.” 

The Globular Cluster Problem. A globular cluster, i.e. an agglomeration of stars 
in spherieal layers of equal density around a common centre, is projected on the 
heavens (or a photographic plate). To express the “true” law of density in space 
in terms of the observed “ apparent ” law of density in the projection. 

Let f(r) dr be the relative number of stars at the distance r to r+ dr from the 
centre of the cluster, and let ¢() dw be the relative number of stars projected on 
a plane and falling within the circular ring of radius « to «+d. If A(r) is the 
density function of the cluster and 8 (x) is the density function of the projection we 
evidently have 

N. f(r) = 40’ A(r); NG (x) = 2728 (x), 
N being the total number of stars in the cluster. But it is immediately seen that 
we must have (2 being the superior limit to the radius vector) 


o/R2-x 


(a) =2 





A(Va* + y) dy, 


or by a simple transformation 


5s rk rdr 
8 (x) =2 | A (r) ——= 
J 2 \ = — i 
Hence we get 
°R ‘ dr 
o(a)=a2) f(r) ————, 
Ja PV 7? — a 


which is the same functional relation as in the corpuscle problem. The two 
problems are thus—although different in origin and formulation—mathematically 
identical. 


9. The methods of solution expanded in § 4 and 5 involve computations and 


constructions, which will certainly be easily mastered by a trained statistician. I am 
afraid, however, that the anatomists, who generally have neither the mathematical 
nor the arithmetical training necessary, will find the methods too complicated for 
application. Furthermore I suppose that the computations involved will be too 
unwieldy, when it is required to reduce the observations from numerous organs 
(the material of Hellman embraces several hundred spleens). To obtain a 
ready method of reduction I have proceeded as follows: 


more 


The observations give directly the distribution of the apparent diameters in the 
classes 1°5—2°5; 2°5—3°5; 3°5—45 mm., etc. We will denote the frequencies in 
these classes by fo, fi, fi, fs, --» fr» Although diameters in the classes O—0°5 and 
0'5—1'5 are generally not observed and hence the frequencies in these classes will 
be unknown, we may, for the sake of completeness, take account of them in the 
following developments, denoting them by f; and f,. We now have, of course, 


I = kr, o (a) da; Si =hkr, | (wv) da jeR@aaweeorennenen (12). 
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The constant k, which as a factor of proportionality will disappear in the final 
reductions to relative frequencies, here assumes the value NY, where Y is the total 
area of the observed sections, and N, as before, the number of corpuscles to the 
unit volume of tissue. 


From equation (5) we now have (7 = 1, 2, ...) 


cath ee ("Fo | dr 7 (13) 


Vi — a" 


Ji] 
or, changing the order of integration and integrating over the variable a, 
ret 


h 
dr F(r) V7? — (i — 0°5 
4 


“R — 
+k| dr F(r) (V7? — (4 — 05 F — V7 — (4 + 0°5") ...(14). 
4 


J t+ 


For Ai we get 


‘) -R 
AR=k| drF(r).r+k dr F(r) (r — V1? — 0°25)........0066 (15). 
/0 Ji 

As only sporadic corpuscles are found with a diameter above 15 mm. we may 
put R= 15'5. Furthermore, we put 


Py=k| drF(r); Pa=| dr F(r), 


and it will be seen from the systems (14, 15), that if we put 


15 15-7 
Mie & © ds, he S Pa. Asie ilesnetecs (16) 
j=l : j=0 
the coefficients a; will be approximately given by 
Ag = 025; ly =) — NP O25; dyp=N 2%—(21—O°5\: 
ay=NV(0 + 4) -(1-—-QO°5/7 - V(i +7) i | errr (16% ). 


Now, the system (16) is an approximative system of equations. The nature of 
the approximations involved will be clear if it is observed that the system has the 
following “exact” meaning: If determined from the system (16) with the coefticients 
(16"*) the quantities P,; P,; P,;... P,; are proportional to the numbers of cor- 
puscles having their diameters exactly equal to 4; 1; 2; 3; ... 15 mm. respectively, 
which, if cut by a random plane section, will give the observed distribution 
ish : ds; ent ia of apparent diameters. Gecemetrical considerations indeed show 
that the coefficients a;; are numerically equal to the combined thickness of the two 
layers parallel to the section plane, in which the centres of corpuscles of diameter j 
must lie, in order that the apparent diameter be between 7 — 0°5 and 7 + 0°5. 


The use of the system (18) will of course be the following: Inserting the 
observed values of f; the corresponding values of P; are obtained. When, as in the 
material of Hellman is the case, we do not know f; and f,, it is seen that we can 
still use all the equations (18) except the first two. Having obtained the values of 
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P,, P;,... Py, it will, however, be possible to find the value of P; and P, by a 
graphical extrapolation—or rather interpolation, as the curve may be assumed to 
pass through the origin. Finally, dividing the quantities P by their sum S we have 
) , 
py =%) as 
eat Magia i 
where F), F,, F,, F,, ... F\; denote the relative frequencies of the true diameters in 
the classes O—0°5, 0°5—1°5, 1°5—2°5, 2:5—3°5, ... 14°5—15°5. 
In the next section we shall give some numerical examples of the use of 
equations (18). 
10. Numerical ecamples. Of the following numerical examples the first three 
are arranged in order to obtain a practical test of the mechanical solution (18). 
In these examples the exact answer is known beforehand, and the examples have been 
so chosen, that they illustrate the applicability of the method in some typical cases. 
Example 1. As a first example I take the following distribution : 
i _ 
$(#)=7,,€ 26 
(5) 


We here evidently have in pro mille 


: ri xr x _ (0 )- 
I, = L000 -@e 0 da= 1000 | l—e 50 \, 
— /0 25 / 
" ritk @ x , (i-}) _ (é+4)? 
and Fi = 1000 xe Hdr=1000(e #0 —e. » ) LA ee (19). 
J i-} =) \ / 


As was shown in § 6 the true distribution is here 


F'(r)= a e 25 
» 
and consequently, if the system (18) is applied, the F;(= P;:%P;) found should be 
approximately equal to the corresponding f;. 
From (19) we get, writing the numerical values below the symbols, 
hi fy / fy fs, he f. f, So So In Sa Sis Iu Ih =f 


& av fs 100 156 Zt Tit 105 89 TE 54 39 27 18 11 FT 8 1600 


The residual number (= 8) is of course the value of the integral 


“x 


: i) (aw) da. 


1 
It may here be left entirely out of consideration. 

Inserting the above values of the f, in the system (18) we get 
P\ P, P, P P, P; Ps P, P P, Pro P,, Ps P. Py, P 
08 69 11°99 163 19°0 19°7 190 169 143 113 86 62 42 2:9 1:7 1°8 
of which the sum is 161°5. Expressing this in pro mille we get 
Fi F, F. F. F, F F, F- Fy Fy Fi Fu Fy F 3 Fy, F's =F 
5 43 74 101 118 122 118 105 89 70 53 38 26 18 11 11 100042 
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It is seen that these values of F; are, indeed, very nearly equal to the f;, which 
shows that in the case treated the system (18) has worked very well. That there 
should be slight deviations at the extremes was, of course, to be expected. 


Finally, it follows from § 6 that here we have 


It follows, however, from § 9 that we also have 
1 &P; 
— ==! = 01608, 
To D> 
which gives in this case 
r, = 6°22. 


tzample 2. It is evident that the system (18) can be used also when we have 
fewer that 15 classes. We then must of course put the superfluous f; equal to zero. 
It is also clear, however, that the fewer the classes the less exact will the mechanical 
solution be. In order to test the degree of approximation in a rather extreme case 
of 6 classes only, we shall apply the method to the distribution 
4 __@ 
i) (x)= (1-27 ye? 2 (1°27)? , 
which gives in pro mille 
fh oh fe fs fy fe =f 
73 428 355 121 20 2 1000-1. 
“quations (18) now give 
P, P, Po Ps Py Ps =P 
57 309 218 64 10 1:0 659 
or in pro mille 
=F 


‘ F; 
87 468 331 98 15 1:5 1000°5. 


As an exact solution would, here also, give the apparent distribution back again, 
we find that 6 classes are generally too few if a higher degree of approximation is 
required. For the anatomic praxis heie in view the accuracy required is, however, 
not so great but that the application of (18) to series of 6 classes will give fairly 
satisfactory results. 


The exact value of 7, is here 


’ 


/ 
— 71 ~~ 
ry = 1:27 a/ 5 = 157. 


The approximate solution gives 


ry = 4000 = 1°52, 


65 
Example 3. The accuracy of the solution (18) depends not only on the number 
of classes into which the observations have been divided, but will be affected also by 
extreme skewness or abnormal excess (i.e. abnormal clustering in the central classes) 


Biometrika xv11 7 








98 The Corpuscle Problem 


in the observed distribution. As an example of a distribution of this kind we may 
take the following one, derived from the frequency function 


a 


a 2 
h (x) = 0°65 e 20127" + 0°35 e 2 (435)2, 


aw x 
(1:27) (4°35)? 

This function has actually been found by me to express the apparent distri- 
bution of the stars within the stellar cluster Messier 3. 

The function gives 

Ai fi to ts ti I. te Rok Le te St te In fz 

50 295 264 124 59 49 43 36 28 20 138 8 5 3 1000-3 
and we find from (18) 
P P, P, A ew BTR RD A RA Om KR RS 
399 2041 1468 529 141 95 79 67 53 38 24 13 09 O8 496-4 
or in pro mille 


P, Py FP, PF; Fy Fy Fe F Fs Fo Fo Fu Fo Fis =F 
80 409 296 107 28 19 16 14 Ill 8 5 3 2 2 1000 
1000 
T S$ = = yx ) 7 
hus To 496"4 01 
The exact distribution is here (§ 6) 
" 2O¢ ps AYA ~4 r » > a ) QF r coat a ; . 
F;= 1692 | om (0 65 (1-27) 2 (127% + 0°35 (4°35)? 2(4 3) dr, 
which gives 
T 
— — 9:19 
r, = 1692 me = 2°12, 
and 
Py P, PF, Fs I, F; F; F;, Fs Fy Fo Fu Fe Fig 
64 377 320 123 35 20 164 #144 #10 7 5 8 2 «21 


It is seen that the result of equations (18) is here somewhat inexact for the 
classes where the apparent diameters are most clustered. 


This approximation 
would, however, be quite sufficient for anatomical purposes. 


Example 4, As a fourth example we take the followimg distribution, observed 
by Hellman in 8 sections from a human spleen. (The diameters are 
as measured with 18 x 18 magnification.) 


given 1n mm., 


Ss=146 | fy=197 | f,=210 | fp=184 | fF=143 | f=95 | fy 
: | - d 


10°5—11°5 | 11°5—12°5 | 12-5—13°5 | 13-5—14°5 | 14%5—1°55 Sum 


fu=15 Si2=7 Jis=4 Su=2 Sis=1 1092 
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The frequencies have been slightly smoothed graphically. For the classes 0—0°5 
and 0°5—1°5 no observations have been made; for the class 1°5—2°5 the frequency 
Jo= 52 was found, but this value is evidently too small (it would give a negative 
value of P,), probably owing to the fact that the observations are not complete 
down to 1°5 mm. 


Using, consequently, only f, tof; we find, inserting in (18), 
P3 P; Ps Ps P; Px Py Py Py = Pip Pi3 Py Pig =P 
154 3271 415 367 304 204 121 66 31 13 O6 O04 O03 2009 
Plotting these values graphically, we may venture an interpolation of the values 
g ea: y I 
of P;, P, and P,, in continuing the curve down to the origin. We have thus 
o's 2 5 : 5D 
found P,;=1, P,=3, and P,=8, The sum now becomes 212°9 and we get pro 
mille 
Fy Fy F, F3 Fy Fs Fs FF; Fs Fy Fy Fyn Fy Fis Fu Fis 
5 14 ‘388 72 Ul 04 re 148 96 ST SLUG 6€ 3S F I 
This gives 


15 
r= siya (2 7 F, + 0°25 F,) = 5°82 
1 


for the mean value of the true diameter. 


7—2 
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INTRODUCTION. 


In August 1902 Sir Francis Galton published a memoir entitled “The Most 
Suitable Proportion between the Values of First and Second Prizes.” He concluded 
that if the amount of the prizes had to be decided before the competition, it seemed 
reasonable to divide the prize money in the ratio of the mean excess in merit 
(supposing it measurable) of candidates who are first over those who are second to 


the mean excess of those who are second over those who are third. 
This led Professor Pearson to consider the following problem : 


“A random sample of » individuals is taken from a population of N members, 
which when N is very large may be taken to obey some law of frequency which is 
expressed by the curve y= NV¢(«), yde being the total frequency of individuals 
with characters or organs lying between # and «+ 6a. It is required to find 
an expression for the average difference in character between the pth and 
(p+1)th individuals when the sample is arranged in order of magnitude of the 
character.” 


This general problem was termed by him “ Francis Galton’s Individual Differ- 
ence Problem in Statistics.” If Xn,p be the average difference it is required to find, 
the solution of the problem is given by 

' Pe 


Xan Go fel —ayr de, 


(n— p)! p! 


where 
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This result was obtained independently by Professor Pearson and Dr Sheppard 
and some numerical results are given in the original paper*, on the assumption 
that the original population is normally distributed when (i) becomes 


a.” r n—p p 
Xn, p (n —p)! p!. Bs, qd a) dx, 
a ¥ 
where a= [ CED: .cckacs sock Dictaea ee eee earn (11), 
wo / 2a 


o being the standard deviation of the original population, and particular values of 
a being obtainable from the tables of the probability integral. 
Two methods were given for the computation : 
(a) Quadrature. 


(8) An expansion of the integral in a series about the mode of the integrand 
by a method due to Laplace. 


La i . . . . 
rhe problem was regarded as a new and important departure in statistical 
theory, in fact Professor Pearson concluded his investigation in these words: “It 
(the method) opens up, indeed, many new methods of enquiry, the effectiveness of 
which however can only be tested by their application in actual statistical practice t.” 
In order to consider the likelihood of the method being useful in statistics, it is 
necessary to go rather further into the theory than has hitherto been attempted. 
It seems desirable to find not only the mean difference between the pth and 
(p+1)th individuals, but also the standard deviation of that difference, and even 
the actual frequency distribution of differences, since if it should turn out to be 
Biometrika, Vol. 1. pp. 385-399. 

+ It was suggested by Professor Pearson that the above result might be used to obtain fraternal 
correlation coefficients in the following manner : 

Pairs of individuals are drawn at random from the general population and their mean difference for 
the character in question is measured. Hence from (i) we may find 

X2,1=e, 
o being the standard deviation of the original population, \ a known number calculated from (i) and 
Xe, the mean of the differences between the pairs drawn. If now pairs of individuals be taken not quite 
at random, but from correlated groups, e.g. pairs of brothers taken at random, the result will be \ times 
the standard deviation of the correlated groups, e.g. a group of brethren, or we have 
X'o.1=AC J1= 72, 

where r is the correlation of the group, e.g. the correlation between brothers. 

Whence JT-pa*™!, 

X2,1 

Now if the phrase ‘‘pairs of brothers taken at random” means ‘‘brothers” in the ordinary sense, 
e.g. pairs of individuals who are actually brothers of one another, I do not think this expression for r is 
correct. It would however be correct on the assumption of linear regression and homoscedasticity, if 
by the term “ brothers” we meant individuals each of whom had a brother with the same value of the 
character, i.e. if we took pairs of individuals at random, each pair from the same array of the correlation 
table. 

But this is of no practical importance as if we had enough material to form a correlation table we 
should use the product-moment method, certainly no one would under these circumstances average the 
differences between pairs taken from the same array. 

At the end of this paper, p. 126, we give the correct expression for 7 when we average differences 
between pairs of ‘‘ brothers,’’ in the ordinary sense of the word. 
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markedly different from the Gaussian that fact would have to be taken into account 
in any applications of the method. It is the object of this paper to determine that 
frequency distribution approximately and if possible exactly, on the hypothesis that 
the original population is normally distributed. 

(1) Mean Difference between the pth and qth Individuals in Order of Magnitude 
of a given Character. 

Although not directly concerned with the main point of this investigation, it 
may be of interest to have a direct formula for the mean difference between the 
pth and qth individuals in random samples of n, arranged in order of magnitude of 
the character. 


Ro 


L_ oh 


ey ae 
0 





Assuming a frequency distribution y = V@(«) in our original population of V 
(where N is very large), let 2, #, be the magnitude of the characters of the pth and 
qth individuals ; then 

a= i " (x) dx =chance of an individual having a value of the 
re character < wp, 
and 1 — a=chance of an individual with a greater value of the character. 

Therefore chance of the pth individual having his value of the character 
between w, and «, + da, 


Mb: 


= n—p — n\pn1 - i 
(n—p)! (p—1)!* (1 — a) $ (@,) day, 


n! ‘ie 
and mean a, = a”? (1 — a)? o (a,) a, da, 
md mean 2, anmienis).. ( )? @ (ap) My 
n! he da 
= - a"? (1 — a)?" a, dx, 
aopt@rD: = ( "P da, 


vp 


oD oO ] 
a: n: f_ a (| (1— a)P—) qn—P basal dey Lp day 
(n—p)!(p—1)!( Jo diy Lp dx, 


0 d rae, ie : 
— 7)P—!1 pip " ek 
* [. > dix, (| _ (l—a)P"a das, det | Ly dis P| 


8 


s n! i = fx (1 @)?-1 qn—P da de ie) x 
~ (n —p)'(p— 1)! | 1 ) dx, dia ieal ? 


) 

- l / 

+ | [ 1 ( 1 - a)? QP - dity (=) | 
J =F ie - 


L he) la 
+ | dx, | (l—a)? tar? . diy 
{ Up 


) a; di, 


0 


—-@ 


ro Lp da 


= dkp 
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All the integrals in this expression are convergent (provided the frequency 
distribution is of finite range or behaves at + 0 like e-**) and the terms in square 
brackets vanish at both limits. 


Similarly we have: 


! Pe as ] 
mean Ly = : I dix, (1 —a)t™ Q"-4 = du, 
(n— gq)! (q- 1)! \Jo J %q da, 
r0) rr, ] 
= di, . ( i — a)i- qd ; a dix,) : 


Therefore mean («, — «9)* 


























if a-arer de [a —aerr eae del 
=], Fe —p+l1,p) “Bie gti. oy 

F [ - (1 —a)? a"? 2 da (1 — a)? a4 =“ da:| 
~)_ ere ~ “eerie | de 

1 } [* (-ayerranv dx § (1 — a)? @"-4 = :) | 
-),U-“ae=petp ~~ serge) 
a B(n—p +1, p) cf B(n—q+1, q) | dae 

: { [ “(1 = a) a2 da ¥ (i-eP" da| 

a ee +L) - Se STTS | da 
= l” [Len —Qt+1,q)—La(n— pth, p)} da .......cccceccecscececeees (iii), 


* To prove mean (x, — #,)=mean x, -- mean zg we may proceed as follows : 
Let 6 be the difference between the values of the character in the pth and qth individuals in any 
particular sample of n individuals. Then 
0=Ly —WUq- 
This is true for every sample, therefore summing for all possible samples and dividing by their number 
0= Lp . qe 
But to some minds it may seem better to approach the problem of this section by the same method as 
that used for consecutive individuals in the 1902 paper. If we do so we are easily led to the expression 
n! | \* m4 (a ~ a! 1(1—q)P- (xy) dx, d 
a4 (a-—-a')tPm! (1 — a)P-! (x, — tq) (Xp) > (aq) dx, dx, 
! p q p q yas 
(n—q)!(q-—p-1)!(p-1)! J _.J -« 
for the mean difference, where 
ae ‘x 
a= f (x) dx, a= | a (x) dx. 
J -& D 


By expanding (a — a’)?-?-! and integrating by parts this may be shown to be equal to 

(1 ) 

; (-1)8 a” sora at—P-s-1 (1 — a)? da | 

s=q=p-1 > | 
- - | dx 

s=0 (q-p-s-—1)!s!(n--q+s+1]) a 





’ 


n! . 
(n-q)!(p-J)!J _. 


f 2 
where i p(x) de. 
—@ 
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x 
where J, (7, s) denotes the ratio of the incomplete Beta-function | (1—2) a dx 
0 
. . 1 
to the complete Beta-function [ (1—a2) a5 da. 
“0 


The numerical computation of this expression would involve the calculation of 
pairs of incomplete Beta-functions of a at convenient intervals of # and then per- 
forming a quadrature upon them. It would therefore be very troublesome, and it 
is questionable whether it would be any improvement on the process of finding the 
mean differences between the pth, (p +1)th, (p + 2)th, ... gth individuals by for- 
mula (i) and summing the results. 


(2) Moments of the Frequency Distribution of Differences between Consecutive 
Individuals, 


We will now find expressions for the moments about any origin and thence 














| 
-p-1 
f2-4 
Xpn+im 
fr) =p J 


about the mean of the Frequency Distribution of Differences between consecutive 
individuals in order of magnitude. A knowledge of the first four moments about 
the mean will enable us to find an approximate curve which very closely represents 
the distribution. 


Let pw,’ be the rth moment about the origin, and yw, the rth moment about the 
mean of the frequency distribution of differences between the pth and (p+ 1)th 
individuals in random samples of n from the original population [y = V¢ («)]. 


[Ze Lp 
Let a denote ‘ (x) dx and a’ denote | P+ b (a) dx. 





If we now expand the incomplete Beta-function and perform the summation for every term, the 
expression may be reduced to 
! 


~~ 4 n! n! 
| ) (1- a)?! qr-aH 4 
x 


n!(l—a)Pa 
J um (m-9at))!(¢-1)! (n-—q +2)!(q-2)! 


n ?) 

= q-2 gn-qt2 1. i cn tsatipadiaael E , 
(1-a)?-"a «Tia -pil ¢ da 
=mean difference between (q - 1)th and qth individuals 
+mean difference between (q - 2)th and (q —1)th individuals 
+mean difference between (q —3)rd and (q — 2)th individuals 


+mean difference between (p—1)th and pth individuals 


a result which is easily seen to be equivalent to (iii) on expanding the two incomplete Beta-functions in 
that expression. 
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Then 


' , 
, n! [* io Sli - da da 
aa aE " ; a?-P- (1 — a)? (ay, — Xp41)" =— diary datys. 
Be (n—p—1)!(p—1)!J -2J -« (ty — Xp) dt,da,,, ° 7" 
n! | ” da Lp , da 
= ———_______ (1 — a)?" — da a’®-P-1 (7, — xy.) dx,,,,\. 
' (n—p—1)!(p—-1)!J -« +. - "Us : oe Aip+1 i 
| 
‘ (tp, . da! 
Now a ea (Xp — Up+ i)’ lax : dry +1 
= Lp+1 


ei  |*P ftp g'™-P — 
- - —p \*e— ten) . +7 = <p — %n) At y+. 
~ & 7 -@ 


The term in square brackets vanishes at both limits. 
‘ma? 


oc ren ‘ _ nea " 
"Go ..* a) Tha 


where Q =| © @t—P (2p — Lar) dap sr. 
. ! = Pp a) po = Pp 
- ih f _(-a) Q 4 | (1—a)? dQ 
(n—p)!(p—1)!\ - ine § 
The term in square brackets again vanishes at both limits and 


dQ) [* 


oo —p . . an ‘—2 /J',, 
oa’? (r= 1) (ay — apps? dtp: 


Therefore Py 


Therefore p, 


dips 3 





dix, ~=-@ 
: , “(r—l).nitf(? f% ; 
Therefore = | (1 — a)? a'™-? (ay — #4)" daydxy, 
by, (n—p)! p! ) 0) -« (x, p+) paky+ 


r(r—1).n!(? /# 4 | 
= — Pp n—p ae \r—2 as 
(n—p)! p! | fa a)’a ( y) d dy, 


rx ty 
where =  (u) du, a’ = | II. watccacuisstintce Kole (iv). 
Putting r = 2, 3, 4 in succession we have: 

, 2.n! 2 1 led 

= ies P o/n—-p uv 
Ps me ae a)? a a 

6.n! ae 

= -_—__ 1—a)?a”?(«#— “LO 

* mor ie ee, ere ee 


12.n! fe re 
’ n | (1 —— a)? a n—p (a = yy dady 


= G@—pyipll - 





and of course py,’ = . [" a”? (1—a)? dx 
(n — p)! p!J -« J 
and by the ordinary formulae for transferring moments to the mean 
Pig = fly = phy? | 
fs = fy — Spy fe + 2p," Pot Taeeanee bese wns eee (vi). 


Ma= Ms = dept’ py’ e pte pr” = 3p, * 
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We have now to consider the best way of computing the expressions (v) 
numerically for particular values of r and p on the assumption that the original 
population is normally distributed. 

We have to calculate integrals of the type 

<2 x 
| [ Z(a—yy dady, 
—-Ds —@ 
where as many values of Z as desired can be calculated numerically, and r is either 
0, 1 or 2. 


By turning the axes through 45° this can be transformed into 


om reo 
| Z'dx'dy’, 
0 / -@ 
where Z=Z(u-yy. 

The integral is now in a shape in which it is easy to apply a cubature* formula 
and the process will be aided by the fact that (#— y) is constant when a’ is constant. 
We may note here that throughout all the numerical work in this paper, the 
standard deviation of the original population is taken as a unit. 

The cubature formula we use is the following ¢ : 

- [* hk 


Zdudy = —-— [— 19 1u_s + 1688u_, — 7843 
_ Sdudy = 39950 L- 191u-s + 1688u_, — 7843u_, + 60480u, 





0 l= 
4+ 128803, + 119272u, + 121151u, 
+ 120960 (uy + tts + Ug + ..+)] ccereceereeecees sceadtVEL) 
where u,=sum of all the Z’s at intervals k for which «= rh. 


correct up to and including seventh order differences. 


This formula is 


The process, in this case, amounts to calculating ordinates at convenient equal 
distances, summing them in diagonals and multiplying by the appropriate co- 
efficients. For example, in the case n = 10, p=1, values of (1 — a) a” are calculated 
at intervals of quarter units of standard deviation from #=— 1°75 to 5°00 and 
y=—1:00 to « +°75 (for smaller values of # and y the ordinates vanish to seven 
places of decimals). 

If we require the second moment of the frequency distribution of differences 
between the first and second individuals in samples of 10, we have only to sum the 
successive diagonals beginning from the left to obtain u_;, w_,, wu, ete. 

If we require the third moment the w’s are obtained by multiplying the sums 
of the successive diagonals by — 3, — 2, —1, 0, 1, 2, 3, ete., and dividing each by 4, 
while for the fourth moment the appropriate multipliers are 5, 4, 0, +5, is, 7, 
etc. The results obtained by the cubature have then to be multiplied by the 
appropriate factorial terms given in (v) (for the case of n = 10, p =1, these will be 
20, 60, 120) ¢. 

Cubature is the name sometimes given to the process of calculation of volumes which is analogous 
to quadrature for areas. 

+ A proof of this formula will be found in 7'racts for Computers, No. x. p. 27. 

+ The numerical work for this case is given fully in Tracts for Computers, No. x. p. 30. 
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In this manner we have calculated the first four moments of the frequency 
distribution of differences for the values n = 8, 10, 50, 100, 500, 1000 and p=1, 
p=2 on the assumption that the character is normally distributed in the original 
population. 


We find: 





n 3 10 50 100 500 1000 

H1 =Xn,p *8458 ‘5388 *3942 "3594 “3044 *2873 
2) 1°1730 5054 2830 2378 1735 "1554 
i 2°1158 "6424 *2825 2201 "1401 “1196 
pa’ 4°5575 1°0004 "3520 *2558 *1432 1171 

p=2 

n 3 10 50 100 500 1000 

py’ Xn, p *8458 *3453 "2263 2018 "1655 *1550 
2 1°1730 °2155 ‘0961 ‘0772 "0526 0463 
ps) 2°1158 "1858 ‘0581 0423 ‘0241 “0201 
py £°5575 *1990 0437 70292 | ‘0141 0112 








while the moments about the mean and £,, 8, are as follows: 


n 3 10 5O 100 500 1000 
po=o7" "4577 2151 1276 "1086 ‘0808 ‘0729 
Bs "3495 *1383 ‘0704 "0565 ‘0381 0331 
By *8988 *2434 “0980 0736 0433 0363 
3 1°28 1°92 2°39 2°49 2°75 2°83 
Bs 4°29 5°26 6°03 6°24 6°63 6°83 

p=2 
n 3 10 50 100 500 1000 
po=0," "4577 0963 0449 0365 0252 0223 
M3 3495 0449 ‘O161 0120 ‘0071 0060 
py “8988 "0540 0127 0090 0044 0037 
B, 1°28 2°26 2°86 2°96 3°15 3°25 
Bo 4°29 5°82 6°30 6°76 6°93 7:44 | 
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It may also be of interest to give the standard deviations and coefficients of 


cht Ga\. 
variation (100 =) in the twelve cases. 




















1 
p=! 
bee ia = | 
n 3 10 | 50 100 500 1000 | 
| | 
Ca "6765 “4638 *3572 *3295 "2843 *2699 
L00c4/pn;' | 79°98 86°08 90°61 91°68 93°40 93°94 
p=2 
n | 3 10 50 100 500 1000 
| | 7 ee See | | 
ee | 
| Oa *6765 "3103 °2119 “1910 *1588 "1494 
LOO g/ j4;' 79°98 89°86 93°64 94°65 95°95 96°39 
It will be seen that both for p=1 and p = 2, although the variability decreases 


fairly rapidly, the coefficient of variation increases steadily and seems to tend to 
100 °/, as n tends to infinity. 

The moments above calculated will be used to obtain curves which approxi- 
mately represent the frequency distribution of differences. 

But before obtaining approximations it will be well to consider more closely 
their exact nature. It is the purpose of the next section to determine the actual 
as distinguished from the approximate frequency distributions and where possible 
to submit them to numerical calculation. Where the latter is not possible our 
investigation may nevertheless assist us in determining what type of curve may 
reasonably be used as an approximation. 

(3) Actual Frequency Distribution of Differences between Consecutive Individuals. 

The chance of the pth and (p +1)th individuals in descending order of magni- 











tude (in a sample of n) having a difference between ¢ and t+ dt is, on the assump- 
tion that the original population is normally distributed, 


n! os 1 . 
‘n—p—1 ] — p-1 - e~(e—t)? y— ha la. lt 
neeenes Ter ae ® vas" ) (ae - 
n! "L=« en _ 1 —(a—1t)2+-142 . 
“expen...” i a ‘Dn? ee aa 
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w-t ] ss rr 1 ; , 
where a = ei" du, a= | —— FO lege 5. c0c0es (viii). 
-% Qa J—@ \ 2Qer 
Putting «—4t=~' we obtain for the above expression 
-- — = a ™-P—1(1 — a)P b 3 e~@" et" da’. dt (ix) 
(n—p—1)!(p-1)!Jv=-« * Qar itachi. —_ 
. w-}t J F re'tht . 
Now a’ =| — edu, a = | e~* du. 
-2 NV Qar =@ Qe 
da! ae Mee 1 da’ 
Therefore ~e= = —— Ce? = = da! . 
ot 2  Qer Ps Ou 
Oa 1 1 pwsins_ 1 


Now if we expand a, a in powers of ¢ we obtain : 


(Pa\ P Ova\ t” 
=e + (58), + (3a), a+ + Ay 17 


of: 
eats (7) vee (oe m8 . é ‘ (55) t” . 
P at ), ae) ait ot ar )rit 
where the suffix 0 denotes that ¢ is to be put zero after the differentiation 
5. a 1 is 
But a = edu, 
-@ NV Qer 
(=?) Sor § Uw 2 
—) =-~- —— ?du= e~% 
Ot/, 20H J —« Von 2/2 
O"a (1\" dt ( = 
(3e),~ (3) am Tie ): 
“" . Oa : 1\" d'- oa § —ly’2 
Similarly (FF -) = (- 3) a5 i(: Pat har : 
Therefore 
; [‘ 1 wg 1 2) ¢ 5 1d ( | oe? (y+ 
ie BE ag dut (5 C8 13° Uda! \Won ) pias 
lo ee 3g 
+ > dal fo e ) (5) Sb ca eeeuaesecanewen (x), 
, I" l = l | “) + l d ( 1 ene” (5) 
a= edu — -@7# - 4 —... 
ly dr 1 oe ey a 
“— (zeae 5 Aisne. -seectneee (x bis). 


If now we substitute the expressions (x) and (x bis) for a and a’ in (ix), re- 
expand and integrate term by term, we obtain the frequency distribution of differ- 
ences in the form 

e~* (Series in Ascending Powers of ¢). 


* We take as our unit of measurement the standard deviation of the original population 
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A mathematical expression for the general term may be found by the use of 
Burmann’s Theorem* for expanding one function in powers of another. 
Burmann’s Theorem may be written 


S (2) =f (a) + S 8 Or 
r=] 


r! 


d=! (((¢—a@)\" w,\} 
where 3, = ; 
vher B, l= F(z) / PX ae 


and a is a solution of the equation F(z) = 0. 








Let t= F(a’) = (1-2), 
i.e. the functions F(a’) and (1 — a) are the inverse functions of 
av—tt | me a) 1 
d= | -e "du and l—-a= — gt? dy t-. 
Jao VQ Jaf 44¢ NV Qar 
Then a is given by F(a’) = (a) =0, 
Ca 
or = —— et" du. 
J-@  Qar 
; = RY 
Accordingly qr tie qt? '+ % or 
r=) * 
when Pee (* = “y ; am (xi) 
yhere = —— 4 n—p- “Pt | amg, rc eereerecece X1). 
da a ( t : ) i.e. t=0 
ee = Ce 
Similarly (l—a)??=(1-—a)?1+ & ' 
y=1 1: 
ad § a-—a\" oo 
where C,.=- ( ) a DRO a — weeeneees xi bis). 
me Oe BaF) Dao |p, on Gl 
, r,, 2 Ut 
Therefore a—P-1 (1 —a)P = U,+ & oe 
yo 1: 
2 : ‘ r(r—1). , ‘ ss 
where U, = B,C, + rB,C,.4 + -->5, 8 AS ee 2) 5) See (xii). 
Therefore the frequency distribution is 
! 1 i TP iets 
y= ~! ae em : ie evapbaseaees (xii) 
(n—p-—1)!(p—-1)!V2r e i 
0 Y ue 
where V,. =| U,e-*" dx 
— \/ Qar 


and U,, B,, C, have the above values; also 


( fa’ 1 )2-p-1 
B=a-?= | —— ei" dy 
{ -® Qa i 
x p-1 
C,=(1-a)P= if a er ldu| 
U,= B,C. 


* See Edwards’ Differential Calculus, 1912 edition, p. 455 and Whittaker and Watson’s Modern 
Analysis, p. 129. 


¢ These inverse functions certainly exist, since for a given 2’, there is only one value of a corre- 
sponding to one of t and vice versa. 
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We thus have a formal solution to the problem; but even if it could be shown 
theoretically that the series was always convergent, it would still have to be 
admitted that the form is not easily adaptable to numerical calculation. 

The calculation of the B’s and C’s in the first place involves the evaluation of 
a number of undetermined forms, which becomes somewhat troublesome when 7 is 
greater than 3. In practice, it is easier to obtain as many terms as desired in the 
expansions of a"-?— and (1 — a)?" direct from the expansions for a and 1 — a, by 
Arbogast’s rule or some other convenient process*. We have, however, succeeded 
in working out completely the cases of n = 2 and n =3. 

(a) n=2, p=1. In this case B,=1, C=1, U,=1, B,=U,=C,=0 for all 
positive integral values of r, 


aa. | ” 
Ve = | e~*"* da’ = 
and the distribution is 


y= /2-— oe = (xiv)}. 


The range is from 0 to «, since the first individual has, by hypothesis, a 
greater value of the character than the second, and so the differences will always 
be of the same sign which we take to be positive. 

Thus we have the interesting result that the frequency distribution of differ- 

5 | 
ences between pairs of individuals taken at random from a normal population, is 
itself a normal curve whose standard deviation is V2 times the standard deviation 
of the original population. 
x’ 1 
(8) n=3,p=1. Here B=a =| edu, C,=1, 


-o NV 2er 


gp — a’, (1 _ q)P- = 3 


ay . q l aN 
Cherefore U,.= B,= -(-—1)y - ( e-#"), O,=0, 
NV 


Qar / 


for all positive integral values of r. 
* Arbogast’s rule in its most convenient form is as follows: if 


x x’ n 
Ayr x tea t a3 35 + see 


x? i x8 
oxzt+A,—-+.. 
“2! "3B 


be expanded in the form 
Ayp+4,x+A 


the coefficients A,, A,, A;, ... can be successively determined from 
i. oe a 0 \ 
Apis =| G +45 ra +... 
Cay Oa, ” 0A, / 
See Edwards’ Differential Calculus, p. 450. 
t+ The moments of this distribution are easily found and are as follows: 


2 : 8 : ‘ 2 4 /4 _ *« & 
y= oak Mg’ =2, py = f=» Ms =12, pw.=2(1- ) » K=F ( = 1) > =A (3 --- :) 5 
V1 V1 \ v Na \™ i i 


This value of ,' is in agreement numerically with that found by Pearson in the 1902 paper. 
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ee 1 


Whence V-= j B,e~** da’ 
J -» NV Qe 
6 oa SF 4 
and y = Fae niiicciistemnvel (xv). 
: N Qar o 7r! 


The numerical calculation was carried out as follows: 


See 1 ae 
ir y t= — se" 
Writing Z = A 
ve hav ws , B,2da' (xvi) 
we have — = POM eionciace he neser ee seeuainonoaee XVl), 
NV Qar J —@ 
l ; , *—1)(r-2 as 
B,= 5, (- 1)" |‘ 4)? (2a/y + = a (— 3)? (22’y* 
ae. 1 —_ y P aa a i ae 
4 (7 dQ A 3) (r — 4) (— AY (2a! Pe oe cee (xvii) 


From (xvii) the explicit values of the B’s were obtained from B, to B,, and then 


y 


jz: was calculated by a quadrature, taking ordinates for # at every half unit of 
V2 


standard deviation. The work was simplified by the fact that when r is even B, 
contains only odd powers of # and consequently V,=0, that is to say only odd 
powers of ¢ occur in the series in (xv). It would take up too much space to give here 


: ae : ; a) ee 
all the numerical work in full, but as a specimen the calculation of Fa is given. 
27 
We have B,= — xy 2 (a4 — 6a’? + 3), 
V. _ 
and 7a -| A 2? (a's — 6a? + 3) da’. 
9 v2 
NV 27 L 


z can be obtained from tables of ordinates of the normal curve, whence we have the 
following values of a’ and 2 (a*— 6a + 3): 


, 


x Z3 (x"* — 6x’* +3) 
+ 45 ‘0000000 
+ 40 ‘0000000 
+ 35 ‘0000000 
+ 3:0 0000030 
+ 2°5 ‘0000246 
+ 2°0 — ‘0007870 
+15 — ‘0118135 
+10 — ‘0283346 
+ 5 0681852 

0 "1904811 


Using the Euler-Maclaurin formula which, in this case, reduces to half the sum 
of the ordinates, we have: 


| 35 2° (a4 — 64°? + 3) da’ = 1215182 
V; iin 
and = — 0037974, 
N 2ar 
V. 


°— = — ‘0000316. 
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In a similar way the other coefficients of the series in (xv) can be calculated 

and we find for the frequency distribution 
y =e*" [84628 — -27566t + 007664 — 000194 
-+ 0000036¢? — 00000002629 + ...] ......(xvill). 

This is the actual frequency distribution of the differences between the first and 
second individuals in samples of three, taken at random from a normal population ; 
the unit in terms of which ¢ is measured being the standard deviation of the 
original population. 

By repeated integration of this series term by term, we may obtain series for 
the moments which are however less rapidly convergent than (xvii). For this 
purpose it was necessary to calculate all the odd V’s up to Vj, so that the first four 
moments might be obtained and compared with the results of Section (2), p. 105. 

If we write (xviii) 

y =e" [ay t+ Gt + a+ al +...] 
the series for the rth moment is given by 


by = [b, + b, + bs + bs +... + Dog, + wale 


L 


where Deyn = Cages | OO ko kwiwdocaacneceeoece (xix) 


~v0 
We have the following numerical values : 
Pe) 


x x x ss Dn 
n ay meat ja, | tent at | Qy-2 tet at | ay | met at|a | etl ar 

0 0 0 Jo | 0 

| 
| 
0 1°49999 
1 - 55132 1°69257 
2 O - 97720 3°00000 
06128 0) 2°20531 6°7 7027 

4 O 16286 0 — 5°86322 17°*99999 
‘ — ‘01216 0 *49006 O 17 °64250 
6G 0 — *04033 0 162864 0 
, “00276 oO — °14569 0 5°88073 
8 0 O1072 0 — 54658 0 
9 — ‘00032 0 04424 0 — 2°33312 
10 8) ‘OO138 0 °19296 0 
11 “00014 8) — ‘00633 0 *93348 
12 O “00068 0 “OBO39 0 
13 — ‘00003 O — ‘00339 0 - *15203 
1) 0 ~ ‘OOO15 | 9) ‘O1761 O 
15 “00002 0 ~— OOO81 0 — ‘09487 
1G 0 “00009 0 - ‘00452 0 
17 ~— ‘00001 8) | “00054 0 — ‘02594 
18 0 ‘00318 0 
19 — ‘00020 8) 01937 
20) — ‘00128 0 
21 ~ 00819 


whence we have: 
area = 1:0001, py = 2°1315, 
py = “8465, py = 45769 
fg = 11731, 


Biometrika xvi1 S 
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By the method of Section (2) we had: 


area = 1°:0000, jtg = 2°1158, 
fy = 8458, pe = 45575. 


pe = 1°1730, 
The agreement seems satisfactory, as the divergence in w; and yp,’ is due to 
stopping at the coefficient of #” and neglecting higher powers in our original 
series (xviii). The moments about the mean by the two methods are given below : 


be B3 My 
Method of Section 2 4565 3656 "8629 
Actual Series 4577 "3495 ‘S988 


ia hl . . . —_ . , , 
The divergences in pw; and p, occur for the same reason as in pw, and py. 


We have further calculated from (xviii) the ordinates of the curve at intervals 
of 0°5, so that it may be plotted. 


t y t y 
00 84628 3°0 01958 
05 66642 3°5 ‘00613 
10 "4.5022 4°0 ‘00168 
15 26055 45 00044 
20 *12898 50 00011 
2°5 05458 5°5 00003 

6:0 ‘00001 


(y) n=10,p=1. An attempt, extending over some weeks, was made to com- 
pute the actual frequency distribution for n= 10, p=1. 


In this case it may be seen from (ix) and (xiii) that the frequency distribution is: 


( = , 
= “ 18> Vet 
N Qar 0 7 
2 Y 
where V,= Se ged en eee eee Ree (xx), 
J -2 NV 2a 


By: - en, , ; , 
and — is the coefficient of ¢” in the expansion of a’* in ascending powers of ¢, a 
r ' 


being given by (x bis). The first ten terms in this expansion were worked out by 
Arbogast’s theorem and the first eight V’s were actually computed; but the 
resulting series was not sufficiently convergent to make the numerical computation 
of the frequency distribution possible from a knowledge of these eight terms. The 
attempt was then abandoned owing to the amount of arithmetic involved in the 
computation of the higher V’s becoming prohibitive. To show that this is so we 
need only mention that B, is given by 


B, = 3 [504028 + 141120az*z, + 2520a? (562°z, + 2102*z,?) 
+ 4200a? (14242, + 1122* 2,2, + 84272,°) 
+ 210a‘ (5622, + 4202?2, 2, + 280222. + 84022,2z, + 105z,") 


+ 42a° (28272, + 16822,2, + 28022.2, + 2802, 2.2 + 210z,2z,) 


“1 <2 


+ Ta® (822, + 282,25 + 56222, + 35z,2) + a7 27)... ececeecececsceeerees (xx1), 
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a’ 1 oad l se 
where . =| edu, z= e732" 
~« NV Qar Qa 
d” 1 “ce 
and , = = ( en} ) 
da” \/ Ir 


Sufficient numerical values of B, have to be calculated in order to make it 
possible to obtain V, by a quadrature ; further the number of terms in B, increases 
with r. 

It would thus seem that for the higher values of n we are thrown back on the 
method of fitting curves to the moments calculated in Section (2) ~ order to 
obtain information about the frequency distributions. 


This information may however be supplemented in one respect, by the com- 


putation of the values of y in the actual curves at the start of their range, ix 
when ¢t = 0, 


We then have 


' 


v= (u—p- I) (p- mi a"P-\(1 — a) da’, 


a’=-@ 
fe 4 ’ 1 ee 
where a= 7 edu, z= Se oc oscseseconsevens (xx1l). 
J -o N27 \ 2Qer 


For p=1, » = 10, 50, 100, 500, 1000 respectively, this reduces to 
90 ae2da', 


x 





a®2?da’, 


-x 


2450 


? 


9900 | a22da’ 
249,500 | a 2? da’, 


999,000 | a 22da’, 


and for p = 2, n = 10, 50, 100, 500, 1000 to 


720 a(1—a)z*da’, 


Pies 


117,600 a?(1 — a) da’, 


—-@ 


970,200 | a” (1—a) 2*dz’, 
12,425,100 a*"(1 — a) 22da’, 
997,002,000 | a™*(1 —a) 2¢da’. 


These integrals may be calculated by means of a quadrature. This has been 
done, taking ordinates at intervals of *2, it being necessary for n= 100, 500 and 


8—2 
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1000 to take “a” to ten places and use ten figure logarithms. We have the 
following results : 
Values of Ordinate when t= 0. 


n=10 n=50 n=100 | n=500 n=1000 
p= 1 1°539 2°249 2°495 3°037 3°241 
p=2 2°537 4°104 4°656 5°769 6°193 


(4) Approximations to the Frequency Distribution of Differences between Con- 
secutive Individuals. 

We will now turn to the problem of finding approximations to the frequency 
distributions by fitting curves to the moments calculated in Section (2). Much 
difficulty has been experienced in finding a curve of a comparatively simple type 
which is at the same time an adequate representation of the distribution for all 
the cases that have been considered. There are of course only two cases in which 
the adequacy of the approximation may be directly tested, namely, n = 2 and 
n = 3, for these are the only two cases in which the ordinates may be calculated. 
In other cases we must be content to compare the first four moments about the 
mean and the initial ordinates of the actual and approximate curves. This is, 
however, adequate, as if these five quantities are in good agreement, the fit may be 
regarded as good. 

Three types of curve suggest themselves as possibilities for approximation. 

(a) A Pearson frequency-curve. 

(8) A curve of the form e~” (polynomial in 2), i.e. the sum of a finite number 
of tetrachoric functions. 

(y) The tail of a normal curve, Le. a curve of the form 

ai {(x + h)? h?) 
yoy e *\ > sg 
(a) Turning to the values of 8,, B, 


given on page 107, we see that they lead 
to the following types : 








n=3 ) 
n= 10 
n= 50 Type Z,, 
p=] 
n= 100 f a\™ a\™s oe 
+ Le. Y= % ~ yhere the origin is : 
n = 500 ey=y (1 +2) (J =) , Where the origin is at 
n = 1000 the mode and a,, a, m,, mM. are positive. 
p=2 {n=3 
| n=10 
| n = 50 Type J,, 
n= 100 


ie. y=y(1+~)'/(2—1)" where the origin i 
I= 2 - &L. Y= Yo\ = where the origin 1s 
I n = 500 . y \ a, \ Cs , si 


n=1000 } at e=0 and a, a,, m,, m, are positive, 
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Now these curves start in the one case with a zero ordinate, and in the other 
case with an infinite ordinate and so are not adapted to the representation of the 
actual distributions which we know start with a finite ordinate. If their constants 
be worked out from the moments, the curves may be shown to fit well except near 
the beginning of the range, but their failure there is a grave disadvantage. 


It may be noticed if we examine the §,, 8, plane that in all the cases con- 
sidered the point §,, 8, lies near that portion of the biquadratic curve along which 


A , ‘ - Ch a — 

the appropriate type is Pearson’s Type IX, i.e. y=y(1 - ) . This is a J curve 
a 

starting with a finite ordinate. It has accordingly been tried for the case of n = 3, 

a and m being determined from the known values of y, and y,’, but the fit while 

naturally good at the beginning of the range is on the whole worse than before. 


This method has therefore to be abandoned. 


(8) The next method which suggests itself is to take a curve of the form 
e* (a + ba + ca* + da) or of the form e~ "(a + ba + ca* + dx* + ea* + fx*) and to 
determine the constants from the known values of the initial ordinate, the area and 
in one case the first two and in the other case the first four moments about 2 = 0. 


Now for n=3 both these forms lead to an excellent fit but when the method 
is tried for higher values of n it gives rise to negative frequencies for large values 
of «; showing that 3 or 5 terms are not enough adequately to represent what is 
really an infinite series, even when the coefficients of the terms retained are 
modified so as to make the moments agree with their actual values. So that this 
method too is in general inapplicable. 

(y) The fitting of a curve of the form 

‘a f(w+ h)? - h) 
y=ye "| 2 J 


is perhaps suggested by the fact that when x=2 the fgequency distribution of 
differences is accurately of this type and further that it would seem to be possible 
to expand y in the form e~!* (Series in powers of #), a form which has actually 
been found to represent the distribution. 


This curve is fitted by making its mean and standard deviation identical with 
the actual mean and standard deviation, which leads to equations from which y,, 
h and = may be determined. A comparison between the actual values of y, »; and 
u, and those given by the approximate curve is then a test of the adequacy of the 
fit in those cases when the ordinates of the actual distribution cannot be directly 
calculated. 

It has been shown by Pearson and Lee* that if h’= Be and if m,(h’), or for 
Ps 


; ; : ; ie 
brevity m,’, represents the incomplete normal moment function | e732 a da, 


Jh’ V Qar 


* See Biometrika, Vol. vi. p. 63. 
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: y|) 1 zs ; ; 
and that if y = =| 75 e-th ), so that M is the area of the complete normal curve 
2 \V Qer 


of which our curve is a portion, then 
» to , , fo 
oa\" Be — fy* = MyM —™M,? 
, = Why = to —_ ' , 4\9? 
My ja" (m — h'm,? 
M=1/m,*, 
D = py Me [my — h’tneg’) = pay Wg. .e0e-cevecees (xxiil). 
yw, and yw, have been tabled against h’. Hence we may find y, from the actual 


mean and standard deviation and given W, we may find /’ from the table. Then 
from h’ yy, is found and accordingly : 


p> - ia Wo, 
M = 
and finally Po ae Bg! esccosccccvers nnn re ne .+-(XXiV). 


To test the adequacy of the fit ~, and mw, can then be calculated from the 
relations 


f , , , , e\ 
ag (Ms oh Mm 9 {m \* 
eS oe ) ; 
Mo Mo My Mo 
m,, Ms m, me (m,\? mi’ ‘) 
4 4 3 1 ’ 2 x . 1 \ 
y= >! ,-4—.— + 6—. ‘) —3| ‘) meres ossatkkv 
Mo My Mo My \M \My if 
and together with y, compared with their actual values. 


3 {tae ) 
If y=ye “\ 2 


be fitted in this manner by using the means and standard deviations given on 
p. 107, we find the following values for y,, h and >: 








p= p=2 
n h = Yo n | h } = Yo 
| 

4 0 1°414 "564 2 _ _— — 
3 525 1°271 *848 8 *525 1°27] *848 
10 1°295 1°095 1°529 10 1°609 *878 2°540 
50 | 2°133 1-060 2°250 50 2°295 "785 4°105 
100 2°384 1-046 2°5O08 100 2°587 774 4°655 
500 2°896 1-027 3°037 500 3°071 ‘749 5761 
1000 3°116 1°025 3°237 1000 3°170 ‘733 6°148 
1200 3°189 *729 6°289 


These values if substituted in (xxvi) give us the approximate frequency 
distributions of differences between consecutive individuals. 

To test the adequacy of the approximations we have the following table 
comparing the actual values of y, #; and w, with the values obtained from (xxiv) 
and (xxv). 

Here we are supposing the area of the tail to be unity so that the total frequency of our approximate 


distribution will be unity as it was taken to be in the actual distribution of Section (3). If MZ’ be the 
area of the tail we have M=J/'/m,’. 
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p=l 





n=3 n=10 n=50 n=100 n=500 n= 1000 


| | 
Actual) Fitted | Actual Fitted | Actual | Fitted Actual | Fitted Actual Fitted Actual’ Fitted 


yo 848 | 848 | 17539 1°529 2-249 2-250 2-495 2°508 3°037 | 3°037 3241 3°237 
pz; | 350 | 352 138. «136 «070 069 057, 057 038 «-036) +033 “034 
py 899 | 906 243 239; 098 104 074 «070 0438 —*| 036, — 














n=3 n=10 n=50 n=100 n=500 n= 1000 


Actual Fitted Actual) Fitted Actual Fitted Actual Fitted Actual) Fitted Actual | Fitted 


Yo *848 °848 | 2°537 2°540 4°104 4°105 4°656 4°660 5°769 | 5°761 6°193 | 6°148 
Bs "350 "B52 "045 "045 ‘016 ‘O15 ‘O12 ‘008 ‘007 ‘007 ‘006 ‘006 
My 899 = -906 7054 055 °013 ‘017, ‘009 — "004 — 7004; — 


This shows a satisfactory agreement. 

In the case of n=3,p=1 or 2 we may proceed further and compare the 
ordinates of the actual frequency distribution (xviii) with those of its approxi- 
mation 

at 5 
y = "848 e >| (1° 
We find: 


v y 
Actual Fitted 
, 
0 "846 "848 
"5 ‘666 ‘667 
1°0 *450 "449 
1°5 “261 *260 
2°0 "129 128 
2°5 "055 ‘054 
3°0 | ‘020 ‘020 
3°5 006 006 | 
4-0 “002 “002 | 
1°5 000 000 


which also shows a satisfactory agreement. 


* The values of «, in the approximate curves (when n=500 and 1000, p=1 and n=100, 500 and 1000, 
p=2) cannot be obtained because three decimal places in h are inadequate for this purpose and for such 
large values of h we cannot table y, to more than three figures even using Sheppard’s Tables of the 
Probability Integral to twelve figures. Therefore h itself cannot be obtained to more figures. 
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The approximate distributions are shown in Diagrams 1 and 2, while in 
Diagrams 3, 4, 5 h, & and y are plotted against n. The points lie on smooth 
curves and thus it is possible to obtain from these graphs the approximate frequency 
distributions of differences between consecutive individuals for other values of n 
when p is either 1 or 2; that is the approximate frequency distribution of 
differences between the first and second or second and third individuals in samples 
of any size. 

In Diagram 1 the unit of the horizontal scale is the standard deviation of the 
original population, the total frequency is taken as unity, and where the reading 
of the vertical scale is y the corresponding frequency is yds. The standard 
deviations in Diagram 3 and the h’s in Diagrams 4 and 5 are also measured in 
terms of the standard deviation of the original population as a unit. 


CONCLUSION. 


(1) We have shown that the frequency distributions of differences between 
the first and second and between the second and third individuals in order of 
magnitude, in samples of any size taken from a normal population, are closely 
represented by J curves whose equations may be written in the form 


a f(x + h)* - he) 
y=ye “| 2 ff, 


We will now consider briefly what is the bearing of this on Galton’s prize 
problem. Galton said that the proper ratio in which to divide the prize was the 
ratio of the mean excess in merit of the first individual over the second to that of 
the second over the third. 

But it may be urged that we should take not the means but the most probable 
values of these quantities and find their ratio. This would lead us to a difficulty, 
for each of these is zero*, or we have to evaluate in some way an undetermined 
form, and it is not clear how this should be done. For our own part, we should 
be disposed to keep to the means though fully realising that this is somewhat 
arbitrary fT. 

(2) We will now consider how to obtain the correlation between brothers by 
averaging 

(a) The differences, all taken positively, of pairs of individuals drawn at 
random from the population. 


(8) The differences, all taken positively, of pairs of brothers drawn at 
random. 


We shall assume that the original population is normally distributed and that 
the bivariate correlation surface for pairs of brothers is also normal. 


The idea that the most probable value should occur when the two individuals are exactly alike 
ceases to be strange when it is realised that the probability of any particular value is infinitesimal. 
+ [The writer we think overlooks what Galton was aiming at; he was not seeking the most probable 
individual result, but how to be just ox the average to competitors, Ep.] 
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APPROXIMATE FREQUENCY DisTRIBUTIONS 
oF DIFFERENCES BETWEEN SECOND & THIRD INDIVIDUALS 
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CONSTANTS OF APPROXIMATE DISTRIBUTIONS. k AND Yo. (p=1) 
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Let x2,, be the mean of the differences, all taken positively, between pairs of 
individuals drawn at random from the original population, y’.,, the mean of the 
differences, all taken positively, between pairs of brothers drawn at random. 


Then from the previous theory 
y 


a ae cece eee r ee eseeeesersceees occ AREVET 
7 


o being the standard deviation of the original population and we have to find the 
mean value of x’, ;. 


Let the population of pairs of brothers be represented by the normal frequency 





surface 
1 ,_ 1 (#2 -2ray+y? 
z= e 71-r o ) 
Qro2V1 — 7 
and let us determine the mean value of | #— y}|. 
A-Y X+YV 
Let g= =, Y= — 
v2 V2 
“s “2+ y @£—%4 
then A= ; y Y= ; Y 
V2 V2 
and a—y|=|V2Y|=| Y’ |say. 
The surface may then be written 
x y2 
1 ~4————. [X?+ ¥2-r (X2- Y?) 1 od — 
e tae! : rs Jo e 4 ance “(l-r) o?, 
Qaro? V1 — 7" Qaro? V1 — 2 


The frequency distribution for Y may be obtained by integrating for X 
between — o and « namely: 
1 oe 
—— ~€ (¢N1-r)?? 
ao V2r (1 —71) 


whence the frequency distribution for Y’ is 
y” 
] -} aaa 
a= — = @ "teN2Q—n)p co ....(XXVI11), 
V2mro V2(1—1r) 


9 

9 Te. 
1 . ’ r “o/s ee 
[Therefore y’.,= mean) } or oV2(1—1r) =~ ocV1l—rT......(xxix)4 
X 2,1 a ( ) or ( ) ’ 


and a puipiece\annnederad (xxx). 
X21 
* Cf. footnote, p. 101. 
+ This formula is a particular case for ¢;=c, of one first obtained by Pearson in the form 


g ee 
“a>y lo, ¥ - af? ae 
N 43 nr? 


Drapers’ Company Research Memoirs, No. tv. It can be used to determine r if o), c2 are known, or are 
obtained from the observations. The method here proposed differs from Pearson’s, by the fact that we 
eliminate the unknown standard deviation of the original population by taking the ratio of the mean of 
our two sets of differences. 
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It is worth while finding the probable error of r calculated from this formula, 
We shall obtain a first approximation only, assuming statistical differentials may 
be treated as mathematical differentials and that y‘s,1,y2,, are each obtained from 
averaging N pairs. Then 





where d,,, denotes the difference, taken positively, between a pair drawn at 
random ; d', , the difference, taken positively, between a pair of brothers. 
Taking logarithmic differentials of (xxx) 
1 or 8y'o1 8xX2,1 


rl ~ 72 2 , 
4(1—r) ~ 3 X21 X2,1X 21 
Summing for all possible samples and dividing by their number 


1 (dr)? ta (8%) + (8y2,1)" = 28y2,1 5x 2,1 


1 o;* a oy’. +. oy, - 2 [8x01 8x21] 


4 (1 -ry a x’, | X"2,1 X2,1 X21 , 
where the symbol [ ] denotes mean value. 
° ° od, i 2 = = 
But Ox = = V {2 (1 - =\\ D> sscwucasenwesseer (xxxii)* 


and [Sx 18x21] = 0, 
because we are dealing with two independent processes of sampling. 


It remains to find oy’, ,. 





i o's, ee 
Ox = TNT IY 
— Me 
1(, (7 Ye teN20-n}dV" 
—N 2 | / ~X 31 
NVJo  V2reV2 (1-1) j 
-_ sa 
“—9e's +i een —r) 
3-2 ee 9 
= c-2 i! -3\ sos deaiisert haar sd Saitte (xxxiii). 
2! _2 2 
Therefore 7X 201 3 (7 2) o"xX2,1 
X 2,1 N X"2,1 
and ~, {ext 4 Srant — Se), 
(1-1) Xe Xa N 
7—2 - @! sag r) ° 
=2(1-r a cas soniyonncscil xxiv 
or oy (1 »/ WV 7 TN (xxxiv), 


* Of. footnote, p. 111. 
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which gives the following ratios to the probable errors as they would be obtained by 
the product-moment method with the same value of V. 


r Ratio r Ratio 
“: 1943 6 1336 
2 1781 | 1:257 
3 1°644 ‘8 1:187 
“4 1526 9 1°125 
"5 1°425 1-0 1:068 


Thus the probable error is always larger than that given by the product-moment 
method, but if a large enough number of pairs be taken there seems no reason 
why the method should not be practicable. 


(3) In order that we might be able to obtain from the calculated r the 
probability of a deviation in excess or defect, we should really require to know not 
only the standard error but also the frequency distribution of 7 calculated in this 
way. This seems a very difficult problem. For large values of NV it would pre- 
sumably be approximately normal, but for smaller values it would seem to depend 
on the frequency distribution of differences. A knowledge of the frequency distri- 
bution of differences, therefore, may well be a first step in its solution. 

At present our knowledge of the frequency distribution of differences is 
complete only for samples of two or three individuals, though good approximations 
have been obtained for larger samples. This together with consideration of 
probable errors suggests that it is for large numbers of small samples that Galton’s 
Difference Problem is most likely to lead to practical results. 

But it must be acknowledged, in conclusion, that the complicated theory of 
Galton’s Difference Method makes it evident that very considerable difficulties will 
attend its application in practice, if an intelligible and well defined meaning is to 
be attached to the results obtained. 


I am indebted to Professor Pearson for suggesting the subject of this investiga- 
tion and for some valuable suggestions and criticism, and to Miss Ida M*Learn for 
Ss 


the preparation of the diagrams. 








ON THE MULTIPLE CORRELATION OF BROTHERS, BEING 
A NOTE ON MR J. O. IRWIN’S MEMOIR, AND ON MY 
STATEMENT OF THE APPLICATION OF GALTON’S 
DIFFERENCE PROBLEM TO THE DETERMINATION OF 
THE DEGREE OF RELATIONSHIP OF BROTHERS, MADE 
IN AUGUST 1902. (Biometrika, Vol. 1. p. 399.) 


By KARL PEARSON, F.R.S. 


Mr Irwin's paper does not seem to me to put the case for the application of 
Galton’s Difference Problem to the subject of brothers adequately. I had not at 
the date his paper went to press examined my statement of 1902, and took on faith 
that what he wrote in his paper (ftn. p. 101) was in accordance with my remark of 
more than twenty years ago. Returning to that paper I find no ground for his 
interpretation of my statement. My words are the following : 

Anyhow I think we may say that if the individuals [i.e. a pacr] be taken at random from 
a population, then the probable value* of the standard deviation of that population is nearly 3? 
of the difference between the two individuals. Thus by averaging the differences between pairs 
of individuals taken at random we can obtain fairly readily an appreciation of the standard devi- 
ation, i.e. the variability of the general population. Further, if we take individuals not quite at 
random, but from correlated groups, e.g. pairs of brothers selected at random, the 33th of the 
average difference of the pairs will be the standard deviation of the correlated groups, e.g. a 
group of brethren ; hence the degree of relationship between such correlated individuals may be 
determined. This is only a suggestion of one of the many possible uses of Galton’s difference 
problem. (Biometrika, Vol. 1. p. 399.) 

The paragraph may be somewhat obscurely worded, but I think the suggestion 
is definite in the fact that the pair has to be taken from a group of brethren, i.c. a 
group all of whom are offspring of the same parental pair. Mr Irwin supposes me 
to say that the two individuals are to be taken from a group all of whom have a 
brother of a given intensity of character, i.e. from the array of a fraternal corre- 
and thus interpreted there 
appears to me to be no error in it. A group of brethren differs from the general 
population in having all its ancestry right back the same. If o be the standard 
deviation of the general population, then the standard deviation of such a group 


lation table. My statement says a “group of brethren” 


of brethren is ¢ V1 — R?, where R is the multiple correlation coefficient of the son 
on all his ancestors. If we pick out pairs of individuals at random from a group of 
brothers, and from the general population, then the ratio of the mean differences of 
such respective pairs is such that: 
Mean difference of fraternal pairs 
Mean difference of general population pairs — 
[* The use of the term ‘‘probable value” in the light of our present knowledge is incorrect. The 
statement as to averaging differences is correct. But the mean or average value is not the probable value, 
for the distribution is known to be skew, as is well illustrated in Mr Irwin’s paper.] 
Biometrika xv 
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It is thus possible to find R by this process, and R& involves at once a determi- 
nation of r the so-called “ correlation of brothers” as found by a fraternal corre- 
lation table. 


The difficulty of the process in the case of hwman families is their small size; 
this renders it impossible to draw pairs from an adequately large group of 
brothers, who would have ¢V1—R? for their standard deviation. But this difficulty 
(which does not exist in the case of some other species) can be got over as follows: 


Suppose we take samples of size n from a population of standard deviation > 
following the normal law of distribution. Then the distribution curve of the 
standard deviation s in the samples is known to be 


ns* 


S - 
or » 2>2 
z= const. X sa : 


and the chance of a standard deviation falling between s and s+ ds is pro- 


portional to 


But this must also be the chance of the square of the standard deviation s? or 

avi P rs . —- 8 4 ] i a »\2 : iN a : E 
fe having a value between s* and (s + &s)?, or between by and fo + Ofte. Expressing 
the result in terms of fz, We have the chance proportional to 

n—-3 Ni, 
(3) 2 , 2>2 dp, 
i 


S2 


or the frequency curve for the distribution of jy, is 

n—-3 ny 
20 (P2\ 2 2 233 
9 | So 


ty= x ( 


The mean value of », in samples, or “@,, is accordingly 





a n—3 Ny 
) 2 Mey 23 dy 
S2/ Ne Se 
« 0 a _ 
— 
a = a n—3 ny 
(H2\ 2 2>2 OH 
Ss ,) ¢ sy 
0 — — 
“D 
, n+ 1 
3 gain—l) ¢ dz mm 
2S2] 282 \ ) 
a— J i) a ad \ a 
= —_— —_ = ie By if 
1 it Shes 
| 2t—9 ened r( 
2 
70 -_ 
n—1\, 


nt ie 
Hence the mean standard deviation squared of pairs of brothers is obtained by 


taking n= 2, or is equal to 4>?= $o° (1 ~— R?), 


~ 
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Now let a, and #, be two individuals, then their mean = } (a, + 2.) and their 
standard deviation squared equals 
$ {(@, — 3 (4 + 2)) + (2 — § (a, + 2))?} 
sa 4 (x — v2)’. 
Accordingly we want the mean value of } (#, — «,)* for all pairs, say NV, and this 
equals 
} 8 (—#-(@,-2)}* 
N 
=1(¢7 +07 - 2roa,c,), 
where r is the so-called correlation of brothers and o, is the standard deviation of 
the first, o, of the second brother. Taking these equal, we have both equal o; and the 
mean standard deviation squared of pairs of brothers = }o°7(1 —r) = $07 (1 — R?). 
Hence r= R?, or the correlation of brothers is equal to the squared multiple 
correlation coefficient of the offspring on all ancestry. A knowledge of R gives r; 
and Mr Irwin translates my 1 — R? into 1 — r* and supposes I meant pairs from an 
array and not pairs from a group of brothers as I wrote. 
We may now examine the matter from another aspect. Consider the variate a, 
in correlation with the variates #,, #,...,, the correlation coefficients being given 
by 7u» between the uth and vth variates. Then it is well known that the multiple 


correlation coefficient of #, on a, #2, ... @, 1s given by R,, where 


A being the determinant 


Mio l Poy Yin | 
iss On l a 
Yno, Tris Tres ] 


and A, the minor of the leading constituent. If 4 takes the form 


l, 7, 2 7 
r, 2, py. 
Fi Py Esco 
/ P> Pp l 


it is easy to show that 


Ry, = l+(n—l)p’ 


Accordingly if p=r and n be made indefinitely great, we have 
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The form thus taken by F# is, however, the form suitable to an indefinitely great 
group of brothers. We learn accordingly that: 

(a) The multiple correlation coefficient of the offspring on an indefinitely 
extended ancestry is equal to the multiple correlation coefficient of one brother on 
an indefinitely great number of brothers. 

(6) Either of these multiple correlation coefficients is the square root of the 
ordinary correlation coefficient for two brothers. 

(c) The simple correlation coefficient for two brothers is numerically of about 
the order 0°5; it follows that the multiple fraternal correlation is of an order 
somewhat over 0°7. It is this multiple fraternal correlation coefficient which is 
directly determined, when we take the ratio of the mean difference between pairs 
of brothers to the mean difference between pairs of individuals from the general 
population. 


Further, the importance of the multiple fraternal correlation (2), if determined, 
is very great, for on the assumption of the Law of Ancestral Heredity—i.e. the 
decadence of individual ancestral correlations in geometric progression—it enables 
us to write down with comparative ease the expression for the mean value of the 
offspring with a given ancestry. The fundamental formulae for the Law of Ancestral 
Heredity are given in Biometrika, Vol. vill. pp. 239—240, and may be briefly 
recapitulated here. «, is the character value in the individual offspring in the last 
generation. X, is the character of the “ mid-parent” in the pth ancestral genera- 
tion, # and Z are the corresponding means and o,, =, the corresponding standard 
deviations; o, the standard deviation of individuals in the pth ancestral generation; 
Pp—q the average correlation of individual ancestors in the pth generation with their 
offspring in the qth generation (p >q); Ep the coefficient of assortative mating in 
the parents of the pth ancestral generation, p,_, the average coefficient of correlation 
between individual ancestors of the pth and individual offspring of the qth 
ancestral generation, and r,_, the correlation between the mid-parent of the pth 
and the mid-parent of the qth generation. Then the multiple regression formula of 
the offspring on the series of ancestral mid-parents being 


ae oy 7 % ) 
Ly — C,=~S Ir = (Ap Xpt 


with a variability 
2o= con V1 ae S (CyT po), 
and the geometrical decrease of p,_, being assumed, it is shown that 


o 


p 


2 V1 +e, 
(v2) 


—p 


/D\p—¢ ! 2 > 
pq = (V2)P-9 py_y|V(1 + €p) (1 + €)- 
Accordingly if assortative mating be constant throughout the generations, 
p y TS > » trie |: raf r= The aki 
Tp—q follows the geometric law if p,_, does. Then takin 


oe 
1g 


pq = aR, cy =ryn?, 
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we find B-—n=Yn, 
(1 — 2a6? + 8?) 
a : 1=0 
+9 Bd —e) a , 


which give y and y in terms of a and 8. Further, 


or Ra v8 _,. 


the fraternal correlation as we have shown on p. 131 above. 
Accordingly a knowledge of R*? and ithe equations for 7 and y above enable us 
to find B, y and 7 in terms of a or a8. But for parental correlation we have 
aB=r,= V2 p,/(1 +e), 
or a8 is known in terms of the mean (p,) of the two parental correlations and the 
coefficient ¢ of assortative mating. 
We will write p,/(1 +e) = py. 


Substituting for yn in the value for R®, we have 


whence we find 
_ R-af? — R-BV2p/ 
B(R?-a) BR-—V2p, 


Substituting this value of » in the quadratic for 7 above, we find 


| 


0 


2p? — R21 — Re) _ 
R? : 


B? —28 V2 p,’ + — 


— (1 — R*) (R? — 2p,”) 
. —_ 9 : ao ee ee 
01 B=V2p, ta/ PR : 


We shall consider the alternate roots later. 


Then a=V2 pi /B, 


(BYR 
and Pp—q = Pi (43) 


gives the geometric law of degradation of the average individual ancestral corre- 
lations. Finally 
(8? —1) R?-—Bv2p, 
and 9= 
BR? — V2 p, 
enable us to determine 
> a= ama! 
C= yn. 
and complete the solution for the multiple regression equation of offspring on 


ancestry. 
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Thus, a knowledge of: 
(a) Parental correlat.ons ; 
(b) Fraternal correlations ; 
(c) Coefficient of assortative mating, and the assumptions 
(i) that the population is stable 
o, = >, for all values of p, 


Ep = € ” ‘ 


(ii) that the individual ancestral correlations follow a geometric law of degra- 
dation 

enable us to determine the multiple regression equation, without determining 
the individual ancestral correlations. There is, however, an exception which is 
so important that it needs special consideration. 


Our equations become indeter- 
minate when 8 =1. We will deal with this case. 


Case of equality of all coefficients of correlation. When 8 = 1, r,_,=a, whatever 
the values of p and q. All the mid-parents have the same correlations with each 
other and with any generation of offspring. We therefore need the multiple regres- 
sion equation, when the fundamental determinant is 


A= 1, 7, 7,...7 | ton+1 rows and columns. 
a r | 
rs & ? 
r, *, £, l 


= : l n—I1r : 
Add up the last n rows, divide their sums by = and subtract from the 


, 
first row. We have 
ie! Bees oe, 
Ll+n—l1r 
—7 . be, 7 | 
) my 1 
) Yr, 1 a 
nr \ (1 =r)(1 +7”) 
, =(1— , Ba = a 
” ° ( [+ (n—1)r) 1+(n—1)r 


a (l-7r)(14+(m—1)r) 
Similarly Avo = 
imilarly cca. a 
where Aw, is the minor obtained by cancelling the first two rows and first two 
columns. 


Aon ’ 


Now let us endeavour to find 


Au = / V5 7 7 
r,-l, x, ) 
Yr #, Lb 7 
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and subtract from 


= 
Add up the last » — 1 rows, divide their sums by 1+(n—2) 
h = 


the first row. We have 





(n—1)7r? | 
Ay =-| r- —— » Rs OE ce D 
l+(n—2)r , | 
7 y Bs 8 ? 
” 55 a AL ? 
? ee ee 
(rn —1)2° 
=—(1 "i 1+ (n—2) .) Avon 
r(1—r) 
=— Agia. 
1+(n-—3)r ™ 
Substituting the value already found for Aj, we have 
An _ r _ Ap 
Aw ‘o 1+(n—1)r ; Aw 


by the symmetry of the determinant. 

Accordingly the multiple regression equation for the mean value of an array 
of «,’s, when determined from a series of variates #,, @, ... @, of standard deviations 
0, G,...6,, When all the correlations are the same, is 
y r Ly — Lp 


y 
Ly — Ly = ToS 
1 1+(n-—1)r om 
nr 


is the multiple 
1+(n—l)r I 


where {,=0,V1 — R? gives the variability, and R? = 
correlation coefficient *. 

When 2 becomes indefinitely large R? =r and the regression coefficient becomes 
the sum of an infinitely large series of infinitely small terms. That this series will 
take a finite limit is clear; for example, if we take (#, — %,)/o, = constant = q, then 
~ TV ° ° ° 

9 — Ly = l = qo, in the limit. 
l+(n—1)r 


teturning to the case of predicting the probable value of a character from 


» ~( fo _¢ WN 
a knowledge of the ancestral values, we have r = Tp = ABP 1 =a=N2 p,. Thus 
IZ , 1 r Ps > 
PS V2 py Q X p X p 
F/ ieee! | = 0 4 4 > 


0 aoe = 
L+(n—1)V2p 1 ap 

9 | 
* J = gs “lip, 

> =o,V1— Rk, where RP? = “ . 

1+V72(n— 1) p, 

The previous remark holds good here, that as n becomes very large we have 

an indefinitely large number of indefinitely small terms. But there is no advantage 

I do not remember publishing this result before, but have given it occasionally in lecture to 


illustrate multiple correlation. 
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in taking n indefinitely large. When x = 10, excluding cousin marriages, there will 
be 2048 ancestors in the 10th ancestral generation and the “ mid-parent” provided 
by them will be essentially equal to the mean of that generation or X,=X,. 

We may note that in this case 


B\?-2 (| \P-o 
Pra pr( = pil } , 


V2) v2 
or the radix of the geometrical progression is ‘707107 *. 


If we take our system of equal correlations, the deduction of the probable 
character of one brother from n others, then o) =, for all values of p, and if 7 be 
the simple fraternal correlation 


i be nr (Ze) .) 
we we ein — Be n i 


S,-o,V1—R? 
nr" 


where R? = ee ee oy 


When x becomes infinite we have the array of brothers or sisters, which could 
form a family. But this must be the array of offspring from the same total 
ancestry, i.e. for n= 0%, 

* i/o hk: 
u = Iv’; 


or r= V2 p’. 


Here p, = p,/(1 + €). 


Taking stature, forearm and span, we find the mean value of € is ‘235667 +. 
The mean parental inheritance for the same data was 45925}. Hence p,' ='371,6616 
and V2 p, =°5256. The mean value of r the correlation coefficient in the corre- 
sponding tables for stature, forearm and span in pairs of brothers and in pairs of 
sisters was °5250§, which is sensibly in accord with the above result. The advantage 
of such a formula as that just given is that it enables us to deduce fraternal from 
parental and assortative mating correlations, or vice versa. The formula, however, 
essentially depends on 8 being nearly equal to unity, a result which would flow 
from the hypothesis that the associations of the mid-parents of each generation of 
ancestry to each other and to the offspring are of the same intensity. I now pro- 
pose to develop the formulae provided in this note numerically from the data 
given by my family measurements of 1893. The cards then issued were limited to 
four adult siblings in each family, and so in many cases it was only possible to 
measure one brother and one sister. Thus the pairs of brothers and the pairs of 
sisters were not as numerous as we could have wished, while we had ample material 


for parental inheritance and for brother-sister pairs. To illustrate points in this 


* For example the mean parental correlation coefficient being taken at :460 (Biometrika, Vol. 1. 
p- 378) we have *3253, *2300, -1626, °1150, -0813, etc. for the successive grandparental average corre- 
lations. These are by no means out of accord with our somewhat limited experience. 
+ Biometrika, Vol. 11. p. 373. 
t Ibid. p. 378. § Ibid. p. 387. 
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present note and in Mr Irwin’s preceding paper, I have confined myself to the 
single character of stature in pairs of brothers and pairs of sisters. Table I gives the 
differences in stature of 341 pairs of brothers and 482 pairs of sisters arranged in 
}” units*. I first took the differences given in this table, squared them and took 
their weighted mean. This had to be divided by 64, because the units are in 
quarter inches and because what we need (see p. 131) is the mean value of }(a,—22)°. 
In this way I found 

bo? (1— R*) 


I] 
— 
Ss) 


345, for brothers, 


l| 


1°597,267, for sisters. 
But o in inches was 6°85866 for 682 brothers and 6°94701 for 964 sisters. 
Calculating R*, from the above values, I found 
For Brothers: R? =°545,875; for Sisters: R? = 540,157. 

Accordingly, R = ‘7388 for brothers and =°7350 for sisters; these are the 
values of the multiple correlation coefficients for an indefinitely large number of 
brothers and sisters respectively. 

We can approach the subject from another side. The correlation coefficient for 
the 682 pairs of brothers was ‘5196 and for the 964 pairs of sisterst+ was *5370. 
Hence R=Vr gives us R =°7208 and ‘7328 for brothers and sisters respectively. 
These results are not bad, better in the case of sisters than of brothers ; 


we must 
bear in mind, however, that our value for the mean squared standard deviation of 


pairs depends on selecting at random from a normal distribution. 
TABLE I. 
Distribution of Differences of Stature between Pairs of Brothers and Pairs of Sisters. 


Frequency of Pairs. 





” F % | 
Differences in | 





0 | 7 ote) 2 5 6 ? 8 | 9|10| 11 | 12 | 13 | 
quarter inches 
Brother-Pairs | 11°5 | 24 | 29°5| 17°| 35°5 | 18°5 | 24°5 | 22°56 | 31 «| 15 | 18 | 10°5 | 18°5 i) 
Sister-Pairs 28 | 35 | 37 24 45 26 32 25 35°5 | 16 | 39 | 14°5 | 25°5 9 
Diff si 

l erences 1n 1h 15 16 17 18 19 H) | 29 23 2h 5 "} a 
quarter inches 
Brother-Pairs | 10 | 5 11 6°5 | 5°5 5 3°5 25 | 3 ig 2 1 
Sister-Pairs [185/15 | 17 |/8 | 6 ie a a ae { me 





Differences in 


28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 [Totals 
quarter inches 
: 
+z | 
Brother- Pairs 4 l — l 341 | 
Sister- Pairs l — 1 l — | l —\|— 182 


* T have to thank my colleague, Miss E. M. Elderton, for taking out this table from the original data 
cards and for finding several constants for me. 

+ The tables were rendered symmetrical, as there was no reason to distinguish for our present 
purpose between younger and elder siblings, so that they contained 682 and 964 pairs respectively. 
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We now proceeded to take the mean difference of pairs. This was found to be 
203391 for pairs of brothers and 2”-01686 for pairs of sisters, the two standard 
deviations being 2”°61890 and 2”°63572 respectively. 

Calculating r (= R*) from the formula * 


(1-7), 


T 


Mean difference of pews) 4 
( 8.D. of population 
we have 
ry = 526,289 for brothers, =*540,119 for sisters, 
the correlation table values being 


*=°5196 for brothers, =°5370 for sisters. 


The former values give 
R= :7255 for brothers and °7349 for sisters. 
Thus we have the following values of R the multiple fraternal correlation 


coefficient : 


Pairs of Pairs of 

Brothers Sisters 

From mean square S.D. of pairs ee ‘7388 ‘7380 

Irom correlation table value of r ian ‘7208 ‘7328 

From mean difference of brothers —... 7255 "7349 

Mean of three methods at Sa ‘7284 ‘7342 
Mean of results for brothers and sisters 7311 
Corresponding to an average individual) B48 


fraternal correlation ) 
[t will be clear from this numerical investigation that the value of the multiple 
correlation coefficient can be found with reasonable accuracy by taking the squares 
of the differences of brothers and of non-brovhers, and that 


Mean square difference of character in brothers -1-F 
Mean square difference of character in non-brothers ; 
Or, again we may use: 
Mean difference of character in brothers / - 
= V1] — R*, 


Mean difference of character in non-brothers 

The simple correlation coefficient may be found from the multiple correlation 
coefficient by the relation r = R?. 

We may now proceed to determine the constants of the multiple regression 
formulae on the basis of the values of the multiple correlation coefficients deter- 
mined from the mean square standard deviations of pairs, i.e. 

R? =°545,875 for brothers, = *540,157 for sisters. 

We have first, however, to determine p, and p,’ for stature. Taking the mean 

of the two parental correlations for stature, these give 


p, = 5040 for sons, and = *5085 for daughters. 


’ 


* «On Further Methods of measuring Correlation.” Biometric Laboratory Publications, tv. p. 5. 
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Now the coefficient of assortative mating in stature for this same material is 
e = ‘2804. Accordingly 
p: = p,/(1 + €) = "393,627 for sons, and =°397,142 for daughters, 
V2 p; = 556,673 for sons, and 561,644 for daughters, 
2p,2 =*309,885 - » ‘315,444 


From these values we find from the quadratic (p. 133) : 


For Sons For Daughters 
B = 999,759, B= 999,024, 
or, = ‘113,564, or, = 124,264. 
Now the individual ancestral correlations will be given by 
/ B\P> a _(Br . 
Pp= 5040 | K ) > Pp = “29085 | — ) : respectively. 
Nv 2. \v2 ; 


Or, by 
* py = 5040 ('706,936)?7, — p, =°5085 (706,417), 
or, py = 5040 (080,318)?—, pp, = 5085 (087,868)"—. 

The latter values for the individual ancestral correlations show results with a 
ratio of degradation far too small for our experience of grandparental and great- 
grandparental correlations. The second roots of the quadratic—those from the 
minus sign—must be discarded. Or, we conclude that 

8 =°999,759 for sons, and = *999,024 for daughters. 


But values obtained from statistical data and differing only so slightly from 
unity st 


ro 


iggest most cogently that the true value of 8 is unity, or that actually our 
multiple regression falls into the special case dealt with on pp. 134—135. The bio- 
logical significance of this result—applied to somatic character measurements—is 
very considerable. In statistical language we may state the principle as follows : 
The correlation of any “ mid-parent” with every other mid-parent in any ancestral 
generation and with the offspring is the same. Symbolically p,_,= 4, and is inde- 
pendent of p and q. 

Now a=V2),/{\8(1+€)} = V2 p, ='5567 for sons and ‘5616 for daughters. 

Had there been no assortative mating (e=0) we should have found 

a= °'7128 for sons and ‘7193 for daughters. 

Hence according to the intensity of assortative mating there may be con 
siderable variation in the value of this constant mid-parental correlation. 

The values obtained for mean individual ancestral correlations are as follows 
of course for stature: 


Offspring and Sons Daughters 
Parent... dha pid ime set 5040 “5085 
Grandparent ile he ne sae 3563 3592 
Great-Grandparent ane ae wer 2519 2538 
Great-great-Grandparent me <e ‘1781 1793 
Great-great-great-Grandparent ... < "1259 1266 


Great-great-great-great-Grandparent can ‘O8S90 ‘O895 
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We have no ancestral correlations for any accurately measurable character in 
man. Unfortunately in the case of the coat-colour in horses and eye-colour in man 
in the higher relationships lack of data did not permit of a separation of the sexes. 
The most we can do on comparison is to say that values more or less similar to the 
above have been found, but that until we have measurable characters in half-a- 
dozen generations, we can only assert that the above values for stature do not 
present any great improbability as to the rate of degradation of ancestral infiuence. 

Turning to the fundamental multiple regression equation for 8 =1 on p. 135, we 


note that for n ancestral generations : 


eae 
» 


1+(n—1)°5567") 3, 


- 5567 


7, — 1, = o> ” for sons, 


and ae 
si , 5616 at PO. Oe 
<< — in = We =e —.. § — ¥__ -” for daughters. 
1+ (n—1)°5616 , a» : 
“ l a 
Here Ld» = Cp ( - ) Vl+e 
\Vv 2 


= 113155 (70711) o, 


is the standard deviation of the pth mid-parents and a, is the standard deviation of 
male individuals in the pth ancestral generation. All female individuals are to be 
reduced to their male equivaients by multiplying their deviation from the female 
is the female standard 


/ 
where o, 


deviation. X,’ is the mean of all possible mid-parents in the pth generation thus 


mean of their generation by the factor ¢,/e,’, 
reduced. This definition of mid-parent and mid-parental mean is not an arbitrary 
one, but flows at once from the general principles of linear regression. For our 
particular data o, is 2’°61890 for sons and ¢,’ is 2”°63572 for daughters. While, 
as before stated, the substitution of =, would appear formally to give a divergent 


series, this is not so actually. Every X,—X, must be estimated in terms of its 
special standard deviation, and while =, gets smaller and smaller so does X,—X,. 


This flows from the simple fact that without special inbreeding it is impossible to 
obtain several hundred ancestors in the 7th or 8th generation all deviating by the 
same considerable quantity from the mean of their generation. It is easy to have 
two exceptional parents, but not easy to have a multitude of exceptional ancestors 
of a high ancestral order. For particular purposes it will always be adequate to take 
na simple digit in the above formulae. 

To some biologists this Note—if they consent to read it—may appear a mere 
juggling with figures, which no doubt we shall be informed are valueless as data 
because, it will be asserted, they were not collected by a biologist ! But at bottom 
there is a broad biological principle in the matter dealt with, which is really not 
beyond the comprehension of the non-mathematical. Let us take all the possible 
offspring of a human pair as a single group G,, the parents as a group G;, the 
grandparents as a group G, and the ancestors in the pth generation as a group G,. 
Then a study of the reduced variability of group G, as compared with the general 
population of their generation in the case of stature has shown us that if we treat 
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the groups G,, G,,... G,,... as individuals, these individuals are all equally asso- 
ciated with the ultimate offspring G, or with any other of these ancestral groups. 
It is precisely as if each group of co-generational ancestry including the offspring 
formed a single individual, and these single individuals like a group of brothers 
had the same degree of resemblance to each other. Is the individual more like his 
parents than his grandparents, or more like his grandparents than his great-grand- 
parents ? He is more like his individual parent than his individual grandparent, 
and more like the latter than his individual great-grandparent and so on (see 
p. 139). But if we drop the word individual, and replace it by the group as a whole, 
then the offspring resemble in an equal degree each successive ancestral group as 
represented by the appropriate “ mid-parent.” The individual offspring is as like 
to his grandparents, or indeed to his 2048 group of ancestors in the 10th preceding 
generation, as he is to his direct parental pair. Such is the conclusion we can 
draw, for at least one somatic character, by a full investigation of the multiple 
correlation coefficient of an indefinitely large group of brothers. 

I have endeavoured in this Note on Mr Irwin’s paper to express the ideas 
of which I was more or less conscious when I wrote my paragraph on the application 
of Galton’s Difference Problem to pairs of brothers in the summer of 1902. My 
wording was possibly obscure, but I was convinced that the suggestion contained 
more than Mr Irwin is now inclined to read into it, and I am very grateful to him 
for bringing me back to the study of my note on the subject in the first volume of 
this Journal. It was not till I had read it carefully again, that my ideas of 1902 
came back to me more clearly, and he has enabled me at last to put on record what 
I meant then by the paragraph which I venture to think he has misread. 








HEIGHTS AND WEIGHTS OF PATIENTS 
IN MENTAL HOSPITALS. 


By ROSA GREEN WOOD, CECILY M. THOMPSON anp HILDA M. WOODS. 


INTRODUCTION. 


In 1923 a Committee appointed by the Chairman of the Board of Control to 
consider the diets of County and Borough Mental Hospitals requested a number of 
Medical Superintendents to furnish particulars of heights, of weights on admission 
and subsequently of ages of samples of their patients. 

The primary object of this request was to ascertain whether any relation could 
be discerned between the weights of adult patients and the computed energy 
values of the standard dietaries of the institutions sampled. For reasons stated 
in the Report of the Committee this object was not satisfactorily attained, but, as 
the returns provided a larger mass of statistics of heights and weights than has so 
far been published in reports or papers upon persons treated in English Mental 
Hospitals, it was thought expedient to analyse the data as completely as practicable. 
A note on the chief results was printed among the appendices to the Report of the 
Committee on Diets. That note was written by Dr Major Greenwood, a member 
of the Committee, under whose general supervision this analysis was made. 

The inquiry has been conducted with the approval of the Committee on 
Quantitative Factors of Human Nutrition (Medical Research Council), and the 
expenses have been defrayed by a grant from the Medical Research Council. 


MATERIAL. 


Data of stature were furnished by seventeen mental hospitals included in the 
diet study and by another mental hospital not so included. Scrutiny of these data 
leads to the conclusion that they are reasonably accurate and provide a satisfactory 
basis of comparison. In the first mentioned seventeen Mental Hospitals the ages 
of the patients were also recorded. 

Data of weights are more scanty and considerably less accurate. All the seven- 
teen Mental Hospitals included in the diet study furnished statistics of (a) weight 
on admission, (b) weight at the date of making the record, but only six of these 
gave the weight without clothing and the estimates of the weight of clothing in 
the remainder were so conflicting that, save for one purpose, it was necessary to 


discard the statistics. The mental hospital not included in the diet study provided 


satisfactory records of weight, but not weight on admission. 





Rosa GREENWOOD, CECILY M. Trtiompson AND HiupA M. Woops 143 


In this paper we treat the subjects under the following headings : 

1. Stature of the Mental Hospital Population Compared with Other Mental 
Hospital and Normal Populations. 

2. Weights of Mental Hospital Patients Compared with the Weights of Normal 
Persons, taking account of Height and Age. 


3. Changes of Weight in Inmates of Mental Hospitals During Residence. 


STATURE. 

Excellent material for comparison has been fuily analysed by biometricians, 
notably Dr Tocher and the late Dr Charles Goring, whose work on the anthropo- 
metry of criminals is classical *. 

In Table Ia we set out our own results and in Table Id those of previous students. 
It will be seen on comparison of our results with those of Tocher that the agree- 
ment is good. Tocher distinguished a “General Insane” from an “ Entire Insane ” 
population, the former derived from the latter by exclusion of individuals “ con- 
sidered by their medical attendants to have characters affected by special causes 
not characteristic of insanity in general” (‘Tocher, p. 301)+. We have excluded no 
cases, so that our sample approximates more closely to the “ Entire Insane” than 
to the “General Insane ” of Tocher’s grouping. 

TABLE Ia. 
Mean Height with Probable Errors und Standard Deviations. 














| MALeEs FreMALES 
| 
No. of ob- | - Standard No. of ob . | Standard 
Asylum servations | Mean Height Deviation servations Mean Height } Deviation 
: | 
| T 182 66°27 +°148 2°96 183 61-79+°143 | 2°87 
| S 133 65°97 + °137 2°3 —_ =e 
a P 249 65°19 + °107 2°50 231 60°64 + °136 3°06 
ee ve 175 65°91 + "144 2°82 189 60°04 + °157 3°20 
|r 174 65°74 + °200 3°90 215 61°46+°148 3°22 
1G 146 65°70 + *123 2°21 183 62°27 +143 2°87 
A Q17 65°52 + 132 2°91 230 61°06 + °133 3°00 
ew | B 149 65°35 + °183 3°30 195 61°04+°145 3°00 
shit | N 126 65°37 +°179 2-98 152 61°02 + *130 2°38 
O 63 | 64°96 + *248 2°92 60 64°02 + *164 1°88 
(® 39 | 64°254+°519 | { 83 61°42 + °193 2°60 
Midland D 95 | 6453+°222 | 3 135 1989+ °158 2°73 
(R 44 | 64°98+-109 | 2 50 61°15 + °142 3°33 
South (K 61 65°00 +°195 | 9°95 95 62°06 + °233 3°37 
Western /?H 110 | 65°49+°182 | 2°83 151 60°80 + °185 3°37 
| 
South (J 156 | 65°62+°181 3°35 248 62°10 + °126 2°95 
Eastern IL 152 66°02 + ‘172 3°15 132 61°45 + °164 2°79 
All Asylums 2471 65°55 + 041 3°05 2732 61°27 + -040 3°10 


Goring, The English Convict. London, 1913 (Stationery Office). 
+ Tocher, ‘‘ Anthropometry of Scottish Insane,” Biometrika, Vol. v. 1906, p. 298. 
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It appears that the males of the English Mental Hospitals are very slightly 
shorter on the average and the females slightly taller than those of the Scottish 
Mental Hospitals. 

Tocher inquired whether the differences between the means of the twenty-two 
Mental Hospitals supplying his data were consistent with random sampling, and 
concluded that the inter-local differences were significant. 

We have used a method of analysis similar to that of Tocher (see Tocher, op. 
cit., p. 315 et seq.), and likewise conclude that there is inter-local differentiation 
(Table I). In the lower half of the table we set out the distribution of differences 


TABLE II. 
Differences of Heights from General Mean. 


MALES FEMALES 
Probable Ratio of S.D. Probable | Ratio of S.D. 
Asylum Difference error of of sampling Difference error of | of sampling 
difference to difference difference | to difference 
| | 
Tr + ‘72 1525 +3°41 + °52 1545 + 2°52 
Ss + *42 1783 + 2°09 
; P ‘36 ‘1303 — 2°37 63 “1375 — 3:27 
Northern ; 14 + 36 1555 +1°74 ~ 1°22 “1521 45 
F + +19 1559 + °68 + °20 "1426 oe 95 
G + ‘15 1702 + ‘82 + 1°00 "1545 | 6+ «24°85 
(A — 03 1396 16 20 “1378 — 1°05 
. lh —- “20 L685 = pas 23 *1497 | 11] 
Eastern 1 = iG 1832 ‘70 — 25 ‘1696 — 1°30 
lo — “59 591 1-62 +2°75 2699 | +1123 
je 1°30 3293 1°71 is) *2294 | + 53 
Midland /D° — 1-02 2110 3°17 — 1°38 1799 | 5°89 
LR — “57 1317 3-63 12 ‘1322 | - 60 
South  ¢ - 35 2633 — 1°91 + “79 2145 | + 2°33 
Western ,;.1 06 1961 2 17 1701 | ~ 1°77 
South {J 07 i646 + '27 + °83 327 | 163 | 
Eastern (1 17 1688 1-90 + 18 820 | + ‘76 
MALEs FEMALES 
P.E. Observed | Theoretical P.E. Observed | Theoretical | 
+1 7 85 +1 { 0 
+1—2 5 5D +1—2 4 52 
+2 5 2°9 +2-3 2 2°] 
+3 6 7 
Total | 17 Total 16 
| P=-408 P=-040 


Biometrika xv 10 
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as multiples of the Probable Errors in comparison with the frequencies of a Normal 
Distribution and the deduced values of P. The number of observations is, however, 
very small and little importance attaches to the values of P computed from so 
short a series. 

The range of mean statures is from just over 66:0 inches to 64°25 in males, and 
from 64:0 to 59°9 in females. 

The mean stature of male convicts is nearly the same as that of our male 
patients, and their range (see Goring, p. 195) similar to that found in the sample 
of Mental Hospitals. 

The explanation tendered by Goring to account for the inferiority of the con- 
victed criminal population in stature, viz. that they represented an unfavourable 
selection of the general population, may, we think, be applied with even greater 
plausibility to the case of inmates of mental hospitals, a proportion of whom are 
known to have suffered from defects of development. 

The concordance of our general results with those of ocher is such that we 
have little doubt in inferring that the constants here recorded do typify the 
“Entire Insane” population of English Mental Hospitals, and even less doubt 
in concluding that this population is physically select, its males being, on the 
average, shorter than their social and racial class. Of course the absolute difference 
may be small. There is (see Table Ib) a wide range of stature in normal populations, 
and it is probable—but not certain—that the population from which a majority of 
patients in public mental hospitals are drawn approximates in characters more 
closely to the Artisan Class of Roberts’ survey than to any other group. 

The occupational distribution of the patients forming the sample analysed was 
not given, but the Board of Control supplied us with the classification of the whole 
mental hospital population for 1908-12, which was compared with that of the 
General Population shown in the census of 1901. In his study of criminals, Goring 
employed the occupational classification of Roberts* and used the latter's standard 
population, viz. that of the census of 1871. We have not found in Roberts’ book 
any definition of his categories which would enable us to group the census returns 
of 1901 by his method, and we think that in any case 1871 is too remote from the 
epoch of our observations to be a very good standard. We have therefore re-grouped 
Goring’s data (obtained from the schedule of measurements which was separately 
published) and set out in Table III the three proportional distributions. 

Obviously neither the Convicts nor the Asylum Patients are random samples 
of the General Population. It must of course be remembered that a large dis- 
crepancy under headings XXII and XXIII would be expected, having regard to 
the fact that neither the Convict Prisons nor the Mental Hospitals will contain a 
normal proportion of persons between the ages of 10 and 20. 

With reference to other groups showing wide differences, both Convicts and 
Mental Hospital patients show larger proportions of persons engaged in Pro- 
fessional Occupations and in National Defence, and both show smaller proportions 


Roberts, Manual of Anthropometry. Churchill, London, 1878, 
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of miners and of textile workers. The Convicts show a defect of agricultural 
workers and of persons employed in the food, tobacco and drink industries; in 
each of these cases the Mental Hospital patients show percentages nearer to the 
standard. On the other hand, the proportion of Mental Hospital patients drawn 
from the Building Trades differs more from the standard (in defect) than does that 
observed among Convicts. 


TABLE IIL. 








| 
England and - Criminals | 
; ' Wales Asylum (Goring) | 
Occupational Group (Males) Pavuint Population 
| opulation 1908-12 Population 
Census 1901 ‘ a 1902-8 
Total population less children 
under 10 years = a 12,134,259 10,304 3,000 
Rates per 1000 | Rates per 1000 | Rates per 1000 
I General or Local Government... 14°] 11-4 3°3 
I] Defence of Country aaa dere 13°9 28°2 27°7 
Ii] Professional sks ee as 25°7 39°9 18 °2 
[LV Domestic... Eee Me sais 25°1 29°6 277 
V Commercial are sm —_ 13°7 57°8 717 
VI Conveyance of men, ete. — 102°9 93-0 90°7 
VII Agriculture oe a a 88°3 7571 29°7 
VIII Fishing ... eos ek sei 2°0 2°3 6°7 
[IX Mines Sai oe ee 65°9 16°5 48°7 
x Metals, Machines, etc. ... ae 96°8 63°0 73°0 
XI Precious metals, je wels, etc... 10°8 6°9 17°0 
XII Building ... ae wo 85°9 69°3 << 
XIII = Wood, Furniture, ‘etc. og ao 19°2 rz 3 
XIV_ Brick, Cement, Pottery, ete. ... LiF 19 5°7 
xv ‘hemical, Oils ae vas oo 8-4 1°8 5°0 
XVI Skin, Leather, etc St as 6°6 5°8 3°7 
XVII Paper, Stationery — se 15°5 14°2 9°0 
XVIII Textiles ... = ee ae 10°6 24°9 12°7 
XIX Dress aa es see ~~ 34°2 39°9 53°0 
XX Food, Tobacco, Drink ... sto 63°8 53°5 37°0 
XXI Gas, Water, ete. ... —— 59 1*4 33 
XXII Other general and undefined - 56°0 200°2 285°7 
XXIII Without — oc — or 
unoccupied = 163°0 L07°3 35°0 


A perusal of this table does not suggest that the occupational groups manned 
by the middle and upper classes are under-represented in the Mental Hospital 
Population, and therefore does not offer an obvious explanation of any inferiority 
of physique observed. Thus the same conclusion as drawn by Goring (op. cit., 
p. 178) holds, but less securely than in his case because we have not been able, 
as he was, to compare mental hospital patients and normal persons from the 
same occupational class. This is a task for the future, one thoroughly worth 
accomplishment. 

In Table IV we give the correlation coefficients between stature and age, and 
the correlation ratios of stature upon age for normal and mental hospital popu- 
lations. The differences are not such as to merit comment. 

10—2 








- - Mean Age Standard | Mean Height Standard Correlation Correlation 
. (Yrs. ) Deviations (inches) Deviations} Coefficients Ratios 
vations 
Air Force Candidates. 1238 | 23°92 +097 5°07 68°83 + °048 2°50 — 0339+ °019 1790+ °019 
Males. 
English Mental Hospital. 2471 | 41°94+°209 15°41 65°55 + ‘O41 3:05 0092+ ‘O14 *2298+°013 
Males. ‘ | - 
English Mental Hospital. | 2732 | 43°68+°199 15°42 61°27 + 040 3°10 -°0777 £013 | *2654+ 012 
Females. 
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TABLE IV. 


Correlation Coefficients and Ratios of Age and Height for Air Force and Insane. 











Bopy WEIGHT. 

The subject of change of body weight with age and stature is evidently one of 
considerable importance. Its importance in individual prognosis is, of course, a 
commonplace of medical experience, since a deviation from the individual’s norm 
affords valuable and early indications of disease. The value of a comparison of the 
weight of an individual with the standard provided by statistical averages of 
persons of his age, stature, race and social class is undoubtedly smaller, but the 
very extensive investigation published by the Medico-Actuarial Society of America * 
shows that even here importance can be attached to the procedure. It therefore 
seemed that results of interest would flow from an analysis of experience in Mental 
Hospitals. Unfortunately we have been hampered by two serious defects. The 
first is that data, even for normal populations, are less adequate than might have 
been expected; the second, that the measurements taken in the Mental Hospitals 
have been, on the whole, unsatisfactory. 

The literature of adult body weight is far more scanty and much less exact 
than that of stature; the weight of adult women has indeed hardly been discussed 
at all in England and Wales during recent years. Most of the statistics of body 
weights relate to clothed persons, and the weights of the clothing have often been 
estimated in an unsatisfactory way. In the case of the Mental Hospitals, the 
estimates of weight of clothing had a range from 7 to 14 lbs., and we were quite 
unable to accept these estimates as having any pretensions to exactitude. 

The best data regarding body weight in healthy adults to which we had access 
were measurements of 1238 Air Force candidates (Cripps, Greenwood and New- 
bold)+; these men were weighed stripped, and had a mean weight of 139-86 Ibs. 
Goring’s weights of 2500 convicts, weighed in trousers and shirt, are probably 
equally accurate. Their mean weight was 142°56 lbs., but their mean age was 

* Medico-Actuarial Society of America. Mortality Investigation. Statistics of Height and Weight 
of insured Persons. New York, 1912, Vol. 1. 

+ Cripps, Greenwood and Newbold, “‘A Biometric Study of the inter-relations of ‘Vital Capacity,’ 


Stature, Stem length and weight in a sample of Healthy Male Adults,” Biometrika, Vol. x1v. Parts 3 and 
4, 1923, p. 316, 
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36°24 years and the mean age of the Air Force candidates was only 23°92 years, 
while the statures differed by more than 3 inches. In general there is a positive 
correlation between weight and age, and between weight and height. Since the 
regression of weight upon age is not linear, the customary linear equations of re- 
gression are not exact methods of graduation, but they may be used to make 
rough allowance. Using the regression equation computed on the Air Force data, 
it appears that an Air Force candidate of the mean age and stature of the convicts 
would weigh 137 lbs. Allowing 2 lbs. (a minimum estimate) for the weight of 
trousers and shirt, we reach as the figure to be compared with Goring’s, 139 Ibs. 

All other data relate to clothed men. The 3000 measurements made by 
Hutchinson* eighty years ago had a mean of 147°86 lbs. The 5032 men, aged 
25 years or more, reported on by the British Association+ forty years ago, had 
an average of 159:2 Ibs. Lastly, Schuster’st weighings of undergraduates had a 
mean of 151:94 Ibs. (these men were weighed without boots or coat). 

Hutchinson’s subjects included persons between 15 and 40, all but fifty-nine 
of Schuster’s subjects were between 18 and 23, so that both samples might be 
expected to give lower averages than that of the British Association, which covered 
a larger proportion of older persons. 

The Medico-Actuarial Society of America data refer to 221,694 men having 
a mean age of 33 years and a mean stature of 68°5 inches. The average weight— 
that of the clothed man—was 156:2 Ibs. Levine§ has recently communicated 
the results of 28,697 examinations for life assurance in Great Britain and Ireland 
during the years 1921-22 (these, of course, are also weights of clothed men). 
From the graduated results, it appears that for a man of 36 standing 65} inches 
(the mean values for Goring’s convicts), the mean weight would be about 142 Ibs. 
This is very near Goring’s figure, and perhaps, since the candidates for assurance 
probably had more than shirt and trousers on, is evidence of a slightly lower 
mean weight. 

g, 
precisely what the average weight of a normal adult is. But, making allowance for 
stature, we think that the Air Force candidates were somewhat below the general 


Owing to the difficulty about clothing, we are really not in a position to state 


average or, alternatively, that predictions based upon the Air Force data—involving 
the assumption of linear regression—slightly under-state the position, if the British 
Association data are taken as a standard. Thus the average age of the males 
over 25 in the British Association’s experience was 349 years and the average 
stature 67°92 inches. Using the equation (deduced from the Air Force records) 
Weight (in pounds) = 0°7326 Age (in years) + 3°6133 Height (in inches) — 12637, 

* Hutchinson, ‘‘On the capacity of the lungs and on the Respiratory Functions with a view to 
establishing a precise and easy method of detecting disease by the Spirometer,” T'ransactions of the Royal 
Medico-Chirurgical Society of London, Vol. xxtx. 

+ British Association, Report 1881. Report of the Anthropometric Committee, pp. 242-3. 


+ Schuster, ‘‘First Results from the Oxford Anthropometric Laboratory,” Biometrika, Vol. vit. 
1911, p. 40. 


§ Levine, ‘‘ Heights and Weights,” Journal of the Institute of Actuaries, Vol, Liv. Part 2, July 1923, 
213 
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we predict for this age and height, 1446 lbs., or 14°6 lbs. below the observed 
weight. 146 lbs. seems an improbable figure for the weight of clothing. But on 
the Assurance Standard, the agreement. is close. Levine’s graduation would put 
the weight for this age and height at about 151 lbs., which only requires somewhat 
more than 6 lbs. to be allowed for clothing to coincide with the Air Force predic- 
tion. We do not of course know whether the older British Association figures or 
the modern assurance figures more correctly typify the normal average. If we 
accept the latter, there is very good concordance between the Air Force data and 
the assurance data, and, making allowance for difference of age and stature, between 
both and the weighings of criminals. 

Upon the whole, therefore, we seem quite justified in judging our mental 
hospital data for males with the help of the equation deduced from the Air Force 
records. 

Data of weights of women are, for England and Wales, extremely scanty. 
In Table V we give the results of the American Medico-Actuarial Investigation 


TABLE V. 
Means and Standard Deviations of Age, Weight, and Height for various 
groups of Women. 








Age (in years) Weight {in lbs.) 


)bserva- is Standard | Observa- <a Standard | Observa- Si in 
tions | © “NS | Deviations tions means | Deviations tions oe 

Medico- Actuarial 
as 136375 32°57 1 9°149 136375 | 134°196 19°428 136375 64°282 

Polytechnic Physical 
Training School... 232 20°52 2°29 226 124°38 11°95 226 64°78 
Somerset House 124 26°34 7°69 125 117°24 16°91 125 62°96 
| Bedford College 115 21°13 1°79 70 117°75 16°29 70 64°42 

Equation for Polytechnic Scholars : Weight =3-9072 height — 128-73. 


Equation for American Women : Weight=3'4978 height +°6437 age —111°6156. 


and three short series of English data. The difficulty about the allowance to be 
made for clothing is so great that any comparison is hazardous, but the following 
considerations suggest that the average weight of American women, allowance 
being made for height and age, is greater than that of English women. From the 
American data we have calculated the regression equation connecting weight, 
height and age and have applied this to the prediction of weights for the given 
mean ages and heights in three English series. We find for the Polytechnic women 
1281 lbs. compared with 1244 observed, for the Somerset House women 125°6 
against 117°2 observed, and for the Bedford College students 127-3 as against 
117°8 observed. The Polytechnic Physical Training School women were as a whole 
particularly vigorous subjects and, bearing in mind both the difficulty about 
clothing and the fact that the range of 


age 


covered was very small, we cannot 
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be confident that their mean weight is really much less than that of the 
Americans. But the difference in both the other groups is so large that it seems 


most improbable that the true mean of either reaches the American Standard. 


WeiIGuHtTs OF MENTAL HospitaL PATIENTS. 
In Table VI are given the particulars of weights for both males and females in 
six hospitals which supplied unclothed weights. 
In Table VII we compare the weights of the males with the predictions from 
the Air Force equation, allowing for age and height. 
TABLE VIL. 
Males. Means of Final Weights in Table IV compared with estimates 


from Air Force Equation. 


Asylum Observed weight | Predicted weight Difference 
| 
“ | : 2 

(< 138°S8 + 1°23 141°6 + °33 + 2°8 
| Northern 7 P 140°7+ °82 139°0 + °26 E77 
| iy 125°5+ -94 144-7 + 30 +19°2 
| , 
| Eastern N 136°1+1°33 141°6 + °36 + 5° | 
j | 
| Midland ‘Dp } 130°4+ 1°62 138°3+°88 + 78 
a: ae 132°8+ -82 136-9 + 26 + 41 
| 
| Whole Sample 134°4+ 42 140°7 +13 + 63 

The equation: Weight (lbs.) = 7326 age (years) + 3°6133 height (inches) — 126°37. 


In one instance, the observed weight exceeds the predicted weight, in the other 
five it is in defect. 

Confining ourselves to persons over 20 years, the six mental hospitals provided 
969 males and 1084 females. The biometric constants were evaluated and it 
appeared that the correlation of age with height was negligible. In Table VIII 
we show the mean weights for arrays of age together with the values assigned by 
the regression of weight upon age. 

When these are tested in the usual way for Goodness of Fit* it appears—as 
indeed is obvious from the table—that the fit (in males) is bad, but that this is 
due to great divergence in particular groups. Thus, in the case of females, 38°6 °/, 
of the value of x* is contributed by the last group, and its omission would increase 
P from ‘25 to ‘65 (n’ =14). 


In males the last two groups contributed nearly 
62 


of the value of y* and their omission improves P to ‘073 (for n’ = 13), 
and even in males the general trend is fairly well represented by a straight 
line. In other words, although the regression is certainly not linear, we may say 


that a comparisun of the regression coefficients of weight upon age between 


* Biometrika, Vol. 1x. 1913, p. 28. 
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equations derived from normal populations and from mental hospital populations 
is a justifiable proceeding. Such comparison shows us that the increase of weight 
TABLE VIII. 
Weights ut different Ages for 969 Males and 1084 Females in Six Asylums. 





Weights of Males (Ibs.) Weights of Females (lbs.) 


Years 
of age } 


No. of Ob- No. of Ob- 


pening Observed } Calculated | “~*~. Observed | Calculated 

servations | servations 
23 82 131-23 99 113°13 113°37 
28 129 131°21 116 112-02 113°65 
33 138 137°35 135 112-06 113°94 
38 125 130°93 133 114 87 114°23 
13 106 138°55 150 114°32 114°51 
48 112 138°29 128 117°05 114°80 
53 76 134°33 87 118°58 115°09 
58 70 141°54 77 117°59 115°37 
63 61 138°20 7i 113°42 115°66 
68 39 136°28 39 112°69 115°95 
73 23 137°86 26 116°40 116°23 
78 5 147°25 20 110°25 116°5S2 
83 2 105°25 | 2 110°25 116°8 
88 95°25 l 85°25 L1L7°09 


Weight = ‘0573 age + 112°0487 





Equations: Weight =*1917 age+ 


in LD nv 15 
a2 = 50°3658 «2? =17°0964 
P= :00 P=. °25 


with age is much smaller amongst the insane than amongst normal persons. While 
in respect of stature the insane are differentiated from the normal to the same 
extent as convicted criminals, the latter are not, in respect of weight, subnormal, 
the former are. The most important cause of this is, undoubtedly, that the popula- 
tion of a mental hospital contains many more physically diseased persons than 
a convict prison. Goring’s analysis of the vital statistics of criminals showed inter 
alia, that “the current allegation of criminality and tubercular disease being 
kindred manifestations of the same form of human decadence is entirely unsup- 
ported by the statistical facts” (Goring, op. cit., p. 229). 

In mental hospitals, on the other hand, the rate of mortality and the prevalence 
of tuberculosis are very high. We cannot indeed account for our low average 
weights by the presence in the samples of overtly tuberculous patients, the numbers 
so recorded were very few, but that many were in a pre-tuberculous state can be 
regarded as probable. This, of course, at once raises the extremely interesting 
question, which is cause and which effect, but the present data are neither numerous 
enough nor accurate enough to allow us to discuss that question; a further 
peculiarity of the material is dealt with below. Our demonstration of the low 
weight for age and stature of the population is, we submit, a cogent argument in 
favour of requiring that changes of body weight in mental hospitals should be 


made the object of much more exact inquiry. 
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We are not acquainted with any published investigations of changes of weight 
in the chief clinical types of insanity and, having regard to the enormous field for 
investigation offered by the mental hospitals of this country, lack of such informa- 
tion cannot be justified. It appears that contempt for very humble methods of 
research, mere weighings and measurings, has been carried too far. Perhaps careful 
anthropometry would suggest to the clinician something of as much practical 


importance as is to be gathered from much recent psychological speculation. 


TABLE IX. 


No. of | Mean Period | Mean Gain Percentage of patients | 
Obser- | Mean Age observed who lost | 


Asylum (years) 


of observation in weight 








vations (months) (Ibs. ) weight while under 
observation 
| | 
F M 174 12°] 11°8 6°54 20°7 
F 215 16°3 11°2 2°30 38°6 
G M 146 ey 12°2 3°66 26°0 
KF 183 17°3 10°7 1°10 30°1 
M M 175 35 13°9 1°27 28°0 
: F L189 $1°7 16°3 6°50 26°5 
Northern P oM 49 10-7 18-6 1 25-7 
Fk 231 39°6 16°2 7°40 25D 
S M 133 13-4 14°5 4°33 31°6 
F = = 
: 8 13°0 12°8 3°97 33°5 
{ i" 3 $1°7 13°9 3°60 29 °O 
A M 217 $1°7 12°3 5°49 30:4 
F 230 12.°9 15°1 9°10 22°6 
BM 149 12°] 13°7 7°28 22°1 
E: F 195 13-2 13°9 8-90 241 
ae ls 126 13-2 12-0 3°39 38-9 
F 152 44°3 12°4 — 20 45°4 
O M 63 17°O 19°3 2°48 33°3 
I 60 15°3 13°9 1-90 28°3 | 
D M 95 13°2 12°2 11°15 15°8 
| F 135 14-0 11° 2°70 36°3 
, E M 39 182 13°3 11°00 179 
Midland F 83 18-5 12°3 6°40 265 
|r M 244 38°8 13°7 7°95 21°3 
F 250 11°5 17°3 5°40 27 °2 | 
H M 110 12°3 12-2 1°23 159 
South F 151 15°8 10°5 5°40 L8°5 
Western K M 61 $2°5 10°2 1°52 27°9 
{ Fr 95 14°9 11°7 7°00 24°2 
| 
J M 156 t1°4 11°6 3°27 34°6 
South F 248 14°8 13°9 — 60 16°8 
Eastern | L M 152 10°4 11:7 3°59 36°2 
F 132 14°3 13°1 6°10 27°3 
Weighted § M 2471 12°0 13°4 5°60 27 °2 
Averages | F 2732 13°7 13°7 1-90 30°3 








THE CHANGE OF WEIGHT OF PATIENTS BETWEEN 


ADMISSION AND DATE OF OBSERVATION. 
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In the above sections we have used the weights at the date of completing the 
records, and these were at various times after admission. 


The subject of the change in weight of patients during residence in an asylum 


is a matter of considerable interest. 
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TABLE X, 


Weight of Patients between Date of Admission 
and Date of Inquiry. 


Gained Weight 


Mean 


first 
weight 


134°5 
105°6 


Mean Mean No. who 
gain final lost 
weight 
11°28 | 134°2 36 
12°39 | 112°7 82 
11°37 143°4 38 
11°12 | 122°8 5D 
11°21 143°1 5O 
13°19 | 120°5 50 
14°36 | 144°1 59 
15°09 118°6 59 
9°18 | 139°4 3 
10°80 | 128°] 61 
8°37 112°4 53 
11°80 | 133-2 66 
15°29 | 120°6 52 
12°89 138°0 33 
15°47 | 120°3 17 
12°74 140°9 19 
11°03 | 121°4 69 
10°22 | 139°0 21 
10°65 | 124-0 18 
15°38 | 131°9 15 
13°14 | 115-4 49 
16°73 | 134°8 7 
12.89 | 125°8 22 
13°00 134°5 52 
15°69 | 122-0 67 
ll*1l | 138°6 29 
9°87 119°3 28 
10°48 | 143°8 17 
14°61 126°1 23 
11°16 142°5 54 
10°54 | 119°1 115 
10°99 | 145°5 56 
12°59 118°2 35 


Lost Weight 


Mean 
first 
weight 


132°0 


120°1 
138°9 
124°8 
145°6 
118°3 
140°9 
121°6 
132°5 


) 


Mean 
loss 





Mea 


n 


final 
weight | 


123° 
109° 


131 
117 
34 


108° 
131°$ 
109° 


127° 


Tables IX and X we show the difference 








156 Heights and Weights of Patients in Mental Hospitals 


between the weight on admission and the final weight in all the institutions 
making returns, including of course the six providing what we think are accurate 
weights. The others were included here because although, owing to the difficulty 
about the weight of clothing, the absolute values are useless, it is probable that 
the method of weighing in any one institution is constant so that the differences 
between initial and final weights may have some value. Even this, however, is 
open to question, because some of the weighings were in the summer and some in 
the winter, so that differences between summer and winter clothing are possibly of 
importance. There is, however, no uniformity in the seasonal distribution of initial 
and final weighings, so that such an error is randomly distributed. 

It is very important to notice that the patients weighed are a peculiar selection 
of the inmates. The superintendents were asked (on May 30th, 1922) to supply 
the information* “concerning the first 250 male and 250 female patients admitted 
since January Ist, 1920, in order of their admission, who are still resident in your 
Mental Hospital. Should the number of admissions since the date mentioned be 
less than 250, the actual admissions will meet requirements.” Consequetitly—the 
average time of residence of the patients measured being 13 months— -there 
are excluded from the sample (@) persons who died within the average period of 
months from admission, (b) persons who were discharged cured within that period. 
We are therefore wholly dealing with persons suffering from some mental disorder 
of a relatively chronic type not accompanied at admission by bodily disease of 
a rapidly fatal type. 

For the reasons stated we can, so far as the majority of the institutions are 
concerned, merely express the following prima facie deductions from Table X. 

(1) There is no relation between scheduled diet and body weight. 

(2) About 30 per cent. of the sampled population lose weight between admission 
and the date of final weighing, which is from 12 to 18 months after admission. The 
mean increases and decreases shown in Table X are in several instances large, but, 
bearing in mind the very serious criticism to which the methods of weighing are 
subject, no stress is put upon the precise figures ; they do but indicate the need of 
a wider and more accurate survey. 

(3) This result cannot be explained by the existence in the mental hospital 
population, as sampled, of a supra-normal proportion of persons suffering from 
bodily disease—such as phthisis—notoriously associated with loss of weight. 
Giving a very liberal interpretation to the terms physical disease likely to lead to 


loss of weight, we find that the proportion of patients in our sample belonging to 


this class is less than 5 per cent., and simple calculation shows that dilution of 


normal material with 5 per cent. of persons even enormously underweight would 


not bring down the general average as low as it is found to be. 


(4) A plausible explanation is that many victims of acute psychoses are known 


to lose weight rapidly and that this sample will include a relatively large propor- 


Extract of official letter of Board of Control. 
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tion of such cases. On the other hand, the circumstances of persons of unsound 
mind coming from the economically less prosperous classes are often very unfavour- 
able, and such persons would tend to put on weight when cared for in a public 
institution. These two considerations tend in opposite directions, qualitatively ; 
what their respective values are we cannot tell. 

For males of the average height of our asylum patients, viz. 654 inches, the 
increase of weight for each quinquennium of age ranged from 3 to 5 lbs. between 
45 and 20 and was rather less than 2 lbs. at older ages. We have seen that in the 


Air Force data the approximate increase for each year of age—stature constant 
was rather less than } Ib. or somewhat more than 34 lbs. for a quinquennium. 


This is a finding in good agreement with the Insurance Offices data. 


Since the correlaticn of age and stature (adults) is very slight, the position is 
sufficiently well brought out by the accompanying Diagram 1 which shows the 
means of arrays of weights for quinquennial ages. 


Weights at different Ages for Males and Females in sia good Asylums. 
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Diagram 1 


Whether this divergence from the normal experience has a definite biological 
meaning or whether it is due to the unsatisfactory character of the material we are 


unable to say. 
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We think, however, that the results of this inquiry are sufficient to emphasize 
the need of a much more searching investigation of changes in weight of patients 
in mental hospitals than the present material has permitted us to make. 


It is possible that the simplicity of the measurements has caused them to be 
despised as objects of scientific research, and certainly no one would claim for them 
a high rank in the appraisement of clinical conditions in individual prognosis or 
treatment. But it would, we think, be of some interest to ascertain whether the 
anthropometric characters of different clinical groups within the population of 
mental hospitals and the variations of such characters with age are significantly 
differentiated. 





MISCELLANEA. 


Some further Notes on Cancer and Goitre Distributions. 
By PERCY STOCKS, M.D. 


IN a previous paper* evidence was collected which indicated a relation between the frequency 
of goitre and the incidence of cancer of the digestive organs, more especially the stomach. Since 
the publication of that paper some additional evidence of the same kind relating to the geo- 
graphical distribution of these diseases in America and Sweden has been obtained, and is here 
set down. 


l. More Comple te Analysis of American Statisties. 
During the Great War some 25 millions of recruits called up for service in the U.S. Army 
were subjected to a careful medical examination of which the records have been analysed in an 
official volume t. From these statistics the number of men per thousand examined from each 
State who exhibited various diseases can be ascertained. As regards anomalies of the thyroid 
gland the disorders included are “ goitre (simple),” “ exophthalmic goitre ”and “cretinism.” The 
rates for the last are too small to be reliable for the purpose in view. 
The distribution of exophthalmic goitre is found to be highly correlated with that of simple 
goitre, the correlation coefficients between the frequencies in the various States being : 
For the whole area... ... ... ... ... r="6744 054 
For 37 States east of Long. 104° W. ... 7=°830+ 031 
For 35 Registration States r= "749 + -043 
This high positive correlation is dificult to reconcile with many accepted views as to the 
causation of Graves’ disease, and is in contrast with the infrequency of this disease in goitrous 
areas of Switzerland. The question arises whether the dividing line between simple and exoph- 
thalmic goitres was the same in all the States, or whether in goitrous areas the Local Boards 
may not have looked more carefully for slight indications of hyperthyroidism and therefore 
classed a larger percentage of all goitres as exophthalmic than was the case elsewhere. To test 
this point we may separate the rejected men and those examined by Local Boards from the 
accepted men examined by Medical Boards in camps where the effect of any such local diagnostic 
peculiarity would be less: the correlations are, for rejected men += °611+°062, and for accepted 
men 7=‘689 + *( 52. from which it appears that the local factor as regards diagnosis is not of any 
importance. The correlat ion between Graves’ disease and simple goitre distributions in the U.S.A. 
must therefore be accepted as a rather surprising fact, and it rather suggests that two distinct 
factors may be necessary to produce Graves’ disease, viz. (i) a goitre-producing factor whose 
intensity is indicated by the prevalence of simple goitre, (ii) another factor, probably of nervous 
vigin, which is only present to a slight extent in Switzerland, but is more pronounced in the 
U.S.A. and almost universally present in England and Wales. This, however, is mere speculation 
| have indicated in Table I and on Map No. 1 the total rate for simple goitre in each of the 
American States (i.e. number of men per 1000 examined by Medical Boards, whether rejected or 


accepted for service, in whom simple goitre was noted). In the Report of the 1900 Census the 


* « Cancer and Goitre,” Biometrika, Vol. xv1. Parts 3-4, Dec. 1924, pp. 364-401. 
t ‘‘ Defects found in Drafted Men.” War Dept. Statistical Information, Washington, 1920. 
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TABLE I. 
Goitre and Cancer Distribution in U.S.A. 


1900 









































92. 
| Exoph- i | 
Gerre| Goitre | No- of deaths ——_| Selection | ence ail 
State | per | per l rate of | Percent. Percent. Per- : 
| 1000 | 1000 Cancer, | Cancer _ males Population _tmales cent. Cancer 
males | males |Cancer|Cancerof| organ | for in popu- in popu-| over — stomach | 
qa) | @) total | stomach not | stomach lation lation 45 ota and liver| 
| | | stated (2 | (3) (4) } 
| Alabama nae | “D6 79 340 18 | 89 | 1971 | 50°71 2,348,174 | | = | | 
| Arizona and New | | } | } | 
Mexico ea |) EA ‘91 43 | 9 | 10 | 27° 5 694,512 | 
Arkansas 40| 35 | 171 31° | 63 | 28-7 | 51-5 | 1,752,204 | 
California ... | 4°45] 1°95 1026 295 | 271 | | j5°2 3,426,861 52°9 26°1 12°7 | 
Colorado... ...| 5°29] 3°60 | 184 56.| 58 | 4) 54-7 939.629 | 52:4 | 21-9 | 37-9 | 
Connecticut he’ 89 153 | 555 95 | 158 9 50°0 1,380,631 50°4 22°2 44°6 | 
Delaware ... — 59 | 79 75 | 9 | 12 | 3 | S10 223,003 51°0 93°7 tis: | 
District of Columbia | 1°39 | 3°47 — | | $37,571 46°5 22°8 10°5 
Florida “| 25| 66 83 | 19 38 | 42-2 52:1 968,470 | 51°] 19°4 23°9 | 
|Georgia ... ...{ 52] 1°61 373 30 141 | 129 | 49°8 | 2,895,832 — 
Idaho bes .-. | 26°91 | 3°20 28 | 9 | 5 39°] 57°7 131,866 | 
| Illinois... | 779] 5:57 | 2102! 676 | 542 | 43°3 | 51°3 | 6,485,280] 50°9 | 21-6 | 14-7 
Indiana... —... | 649} 312 | 1036} 214 | 441 | 360 | 49°9 | 2,930,390] 50°8 | 24°8 37°8 
| lowa BIS ... | 6°68] 2°86 801 | 273 | 253 19°38 | 51°8 |} 2,404,021 } — 
Kansas... ...| 1:25] 2°76 | 522 143 199 | 44-2 52°3 | 1,769,257] 51:4 | 22-4 | 68-2] 329 
| Kentucky ... .. | L4l] 1°38 | 548 | 126 205 | 36°4 | 50°8 | 2,416,630 | 50°8 | 19°9 | 586] 27:1 
| Louisiana ... 62 | 7 | 316 64 | 90 | 28:3 | 50°3 | 1,789,509 50°5 16°1 60°9 22°9 
Maine 66 86 | 581 158 125 346 | 50° | 768,014 | 50°6 28°1 | 95:0 13°4 
Maryland 94.) 3°57 548 164 125 38°8 | 567 | 1,449,661 | 50°3 22°3 | 89:2 12°4 
| Massachusetts 32] ‘63 | 1864 347 | 738 30°8 | 48°7 3,852,356 | 19°] 24°2|101°6| 40°5 
| Michigan .. | 11:43] 6°89 | 1404] 411 | 338 | 38°6 | 51°6 3,668,412 | 52°6 | 21°7 | 82°0| 42°0 
Minnesota ... a | oe 3°38 | 659 | 251 | 164 50°7 | 53-2 2,387,125 | 53°4 20:7 | 97°2 54°0 
Mississippi wwe | 64 80 | 258 | 39 | 104 253 | 504 | 1,790,618 | 50-1 | 16:3 | 46°5| 17-4 
| Missouri ...| 3°99| 4°80 908 229 251 34°9 d1°4 3,404,055 50°6 93°1 | 711 35°2 
| Montana ... eos | BE GO 3°39 55 22 7 | 15°8 61°6 548,889 | 54°6 18°4 57°5 | 272 
| Nebraska ... ...| 2141 1°70 260 | 82 | 144 | 45°83 | 52°9 | 1,296,372 | 51°9 | 20° | 78:2} 43-1 | 
Nevada bas ... | 638) 1°52 | 14 4 1 | 10°O 60°5 77,407 —_ | | 
| New Hampshire ...|  °70 | ‘94 | 275 16 156 | 38°7 19-9 | 443,083] 5071 | 28:5 | 90°0| 39°1 
| New Jersey 13) 154 | 949 260 | 243 | 36:8 19°9 | 3,155,900] 50-4 | 21-1 | 862] 39° 
| New York... 119 | 2°94 1448 L097 176 | 33°5 49°7 10,385,227 49°7 22°5 | 94°8 12°8 | 
North Carolina 1-81 | 1°91 | 350 | 17 124 | 20°8 19°5 | 2,559,123 | 50°4 16°] 48 °7 20°7 | 
| North Dakota 873| 341 | 50] 15 i9 | 48-4 | 556 | 646872|] — | 
| Ohio wy 5°59 | 5-08 | 1943 190 745 10-9 0°6 | 5.759.394] 51°3 | 23-2 | 82:0] 40-0 | 
| Oklahoma an ‘87 19 | 14 12 37°'8 53°8 2,028,283 | os an 2 
Oregon 6°37 | 143 52 36 | 486 16°3 783,389 | 53:1 | 24-4] 82°5| 39-4 | 
Pennsylvania 4°81 | 2607 | 688 746 | 36°9 50°9 8,720,017 | 50°8 | 20°99 | 83°5| 41°9 | 
Rhode Island 55] 1°24 | 267 | 57 44 | 25°5 19°] 604,397 19 °2 23°3 90°7 | 37°1 | 
| South Carolina... 94) 1°73 | 231 | 18 81 | 12°0 19°6 | 1,683,724 | 19°8 14°4 44-2] 181 | 
South Dakota ... | 4°09} 3°18 77| 2 | 2 | 41 | 53-3 | 636547 | | 
Tennessee ... “8 1-96 | 2°07 147 66 161 | 23°] 5O*D 2,337,885 | 50°2 | 186 | 51-1) 17°8 | 
| Texas eal sco Bae ‘35 178 108 | 165 | 34°5 | 51°8 | 4,663,228 | } 
| Utah eee | 17°52] 3°82 68 | 20) | 27 48°8 51:2 | 149.396 | 51°6 16°6 66°9| 39:0 | 
| Vermont | 214) 4°28 | 281 | 60 115 | 3671 50-9 | 352428 | 50:8 | 28°5 | 96-4] 43-9 | 
| Virginia ...| 3°38] 3°99 | 461 91 148 | 28°7 | 49:9 2,309,187 | 50°6 | 185 | 61:8} 25°9 | 
| Washington ... | 23-40} 9-42 | 185 17 67 | 39:8 | 58-7 | 1356621} 54-2 | 22-7 | 778| 40-5 | 
| West Virginia ae | 7°89 1°52 | 188 | 54 | 50 39°1 | 52:1 1,463,701 | — — | — 
| Wisconsin ... ... | 14°02 | 7°94 | 980 313 | 390 53°1 51°6 2,632,067 51°5 | 22-1 | 85:0 Ol | 
Wyoming ... - | 15°38 | 2°71 | 16 | 6 | 5 54°5 | 62°9 194,402 | | — | 
(1) Total rate from all recruits examined, whether accepted or rejected. 
(2) Corrected by omitting from total cancer deaths those where organ was unspecified. 
(3) Corrected for age and sex distribution by approximate standardization from data in preceding columns, 


(4) Corrected for cancers of unspecified site, and for age and sex by same methods as (3). 
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number of deaths during that year from cancer of all organs and for cancer of the stomach were 
recorded for all the States, but in subsequent years mortality figures were confined to a limited 
number of “ Registration States.” In the 1920 report these number 35, and in these more recent 
records, cancer of the stomach and liver are grouped together. In 1900 the statistics for many 
of the sparsely populated Western States were of somewhat doubtful value and I have therefore 
correlated goitre frequency with the cancer factor in this year for (i) the whole area, and (ii) all 
States east of Long. 104° W., and for the year 1920 for (iii) the 35 Registration States. 

The actual mortality rates from cancer depend so greatly on the proportions of negroes to 
whites in the populations of the various States that they do not afford a true index of the real 
incidence of the disease, but the selection rates of cancer for any one organ (i.e. ratio of cancer 
of that organ to cancer of all organs) are less liable to distortion from this cause. I have only 
attempted to use actual death-rates for correlation in the case of the 1920 mortality data. 

A second source of error is the variation in sex-ratio in the populations of States, the north- 
western States being characterized by a preponderance of males. Since cancer of the stomach 
forms a larger percentage of male cancers than of female cancers, the selection rate of cancer for 
the stomach in the whole population is positively correlated with the proportion of males. The 
mortality statistics do not separate the sexes and I have therefore attempted to eliminate this 
source of error by partial correlation in the 1900 data, whilst the 1920 mortality rates for cancer 
generally and for cancer of stomach and liver have been independently corrected for differing 
sex-distribution in the usual way before correlating. 

A third source of error, which may also affect the selection rate as well as the actual death 
rates, is differing age distribution, and an approximate standardization of the deaths has been 
carried out for 1920, again treating cancer of all organs and cancer of stomach and liver separately. 
In this way a selection rate for each State was obtained for 1920 which had been corrected for 
unclassified cancer deaths and also in a rough manner for differing age and sex distribution. For 
this year it was also possible to apply Professor Pearson’s method of correlating actual numbers 
of cases for constant population in order to eliminate possible spurious correlation. 

The principal cancer statistics used for obtaining these correlations are given in Table I, and 
the results are given below. For the sake of comparison correlations have also been worked out 
with the total tuberculosis rate in recruits, the diabetes rate in recruits, the diabetes death-rate 
in 1920, and the exophthalmic goitre rate in recruits. 

(a) For the whole area*, correlating goitre rate in men with selection rate of cancer for the 
stomach obtained from 1900 statistics (correcting for cancer deaths where the organ was un- 
specified 

r='509 + ‘073, Ne.« "759. 

This is clearly not a linear regression, but it becomes approximately so on omitting the 
11 Western States with density of population less than 10 to the square mile, in which it is 
suspected that diagnosis was not so reliable (see 8). The fact that the value of 7 then becomes 
larger suggests that there is a real relation which tends to be obscured by faulty statistics. 


For comparison the following correlations over the same area may be noted. 


Tuberculosis rate with goitre rate ...0 0.0.0 60.0 cee eee eee P= — 290+ 090 
99 » With selection rate of cancer for stomach 7 - "244+ ‘092 
Diabetes rate a a "4 at ” ye 254 + 092 
9 o>  @ @eite rete ns, ace ce as es Oe 

9 » exophthalmic goitre rate ...0 0... 0... 7 ‘281+ 091 


8) For the 37 States east of Long. 104° W., correlating goitre rate in men with selection rate 
of cancer for the stomach obtained from 1900 data as before 
7=°64 t+ 065, n 


= "685, 


47 areas, combining Arizona and New Mexico owing to their yery small figures, and excluding 
Indian territory, Alaska and District of Columbia. 
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Since the regression now approximates to linearity I have endeavoured to eliminate the 
effect of differing sex distribution by partial correlation ; and also to test whether the exophthalmic 
goitre correlation with the cancer factor arises from its own correlation with simple goitre. 


Selection rate of cancer for stomach with percentage of males in population r= *534+°079 
Goitre rate with percentage of males oat fees wee 0s ees, eee SS | SES eee 
Selection rate with goitre, percentage of males being constant ... ... ... r= 6394052 


‘ : , . : r= B8+'0/5 
Selection rate with exophthalmic goitre rate ... / { in 


"Nc. = "604 


oi (fr = *830+°034 
Goitre rate a és = aor Sue? Naiie as Emer” alot) ieee aaa ae 

ln, « = 896 
Exophthalmic goitre rate with percentage of males... ... r= ‘28747102 


Selection rate with exophthalmic goitre rate, percentage of males pene" slnagie 
‘ oats Soe. oes, MSS Gane a 
Selection rate with goitre rate, percentage ee males ll e sisieihileidlialis goitre 

rate being constant Soer= aes east ual ape Pees “meee -<o0 coe cee oR |6(°4AT HE OCD 


goitre rate being constant ... 0... 0... oe 


The last two figures suggest that the correlation between exophthalmic goitre and the cancer 
selection rate arises entirely from the relation between exophthalmic and simple goitre dis- 
tributions. For comparison the following have been computed, percentage of males being constant. 


Tuberculosis rate with selection rate of cancer for stomach += —*268+:°103 
Diabetes rate “ - 3 is = r 215+°106 
= ~ »  goitre rate ... ... os. eee vese ose Oem ‘SEE Ge cee 
o = »  exophthalmic goitre rate wee sce «= 8894-105 


It may be noted that a positive relation between diabetes and total agi ogy for the 
American cities was demonstrated by Dr Maynard in this Journal (Vol. vir. p. 276)*. A signi- 
ficant relation between diabetes and exophthalmic goitre would be of interest in view of recent 
claims as to the beneficial effect of insulin treatment in Graves’ Disease. If the diabetes death- 
rates for 1920 are correlated with those of exophthalmic goitre, however, r="148+°108, and 
with those of simple goitre ,="025+°111, neither being significant. 

(y) For the 35 Registration States, correlating goitre rate in men with cancer mortality 
statistics for 1920, after correcting for age and sex distributions as well as for cancers of un- 
specified organ. 

(i) Goitre cases, estimated, with deaths from cancer of stomach and liver, 

corrected . wee! ane Sud wid Gc ee Weems ee) Sees” RGR er, peal, en 
Ditto for semihiiiinn I a. ae ane Mat ance Bow me —_ ... r="438+ 092 


) Goitre rate with corrected mortality rate from cancer of stomac ny ik ii r 





for total cancer death-rate constant ...0 0.0 66. cee eee wee 516 +084 





iii) Goitre rate with selection rate of cancer for stomach and liver vce eee «6 = "461 + 000 

It will be noticed that the three methods of correlation give results of the same order. 

These figures seem to indicate that there is evident in the American States as in Switzerland 
a significant relation between goitre frequency and the tendency of cancer to attack the stomach. 
In view of the more complex factors involved and the dubious value of the mortality statistics 
for some of the States this conclusion is put forward with due reserve, but the correspondence 
between the results obtained in such widely differing countries is remarkable. 

II. Gottre and Cancer in Sweden. 

Through the kind cooperation of Mr R. Moosberg of Stockholm I have been able to obtain 
statistics of investigations on goitre frequency in the two counties where goitre is most prevalent. 
In 1892 a survey of school-children aged 8-16, of confirmees aged 16-18 and of recruits for 
military service aged 21 and over was carried out in the Departments of Kopparberg and Gefle- 
borg. In the former county as a whole 346 children out of 2221 examined were found to be 


* For a criticism of Maynard’s result: see Biometrika, Vol. x1. p. 191 et seq. 


11—2 
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goitrous (15°6 %), 309 confirmees out of 1624 (19 %) and 109 soldiers out of 677 (16 7%). A more 
recent survey in 1910 of the district of Husby in the same county showed 15-9 % of males and 
33°6 7% of females to have goitre. Other important endemic areas in the county were the districts 
of Bjursis, Sundborn, Stora Tuna and Sviirisjo. In the Department of Gefleborg goitre was 
found to be endemic in a few areas, but for the county as a whole the percentage of recruits 
affected was only 1°3 % and of children 12°2 %. The most goitrous districts were Ockelbo with 
14% of children and 30 ¥ of adolescents goitrous, and Bjendker and Alfta with lower rates. A 
later investigation in the Hégbo district showed 5 % of men and 30 % of women to be goitrous. 


[ have shown in Table IT the total cancer deaths, deaths from all causes, deaths from unknown 
causes and corrected cancer mortality rates for the year 1911 for (i) the whole county of Kop- 
parberg, (ii) goitrous areas of Ockelbo, Bjenaker and Alfta of the county of Gefleborg, and 
iii) rural areas of Sweden as a whole. 


TABLE II. 
Deaths from 


‘ | 
Jancer ! rrected | 
Cancer All baa Corrected | 


Population 


Area deaths deaths cancer | 
1911 cause 
1911 | 1911 1911 death-rate | 
Department of Kopparberg ... 235,427 | 240 | 3067 240 
Goitrous districts of Gefleborg 19,159 20 237 22 
Total goitre areas ai eae 254,586 260 3304 262 110°9 
Sweden (rural ul = 1,159,216 96°7 


It will be seen that the goitrous areas had apparently a somewhat higher cancer mortality rate 
than rural Sweden as a whole. It is unfortunately not possible to obtain more extended data. 


Pir. Exophthalinie Goitre in England and Wales. 


In view of the interest which is at the present time being aroused on the question of goitre 
in England and Wales, [ have thought it advisable to include with this paper a map (No. 2) 
showing the standardized mean mortality rates from exophthalmic goitre over the 10 years 1913 
1922 in the various counties (excluding county boroughs) which has been compiled from the 
statistics of the Registrar-General. For explanation of the method used in standardizing reference 
should be made to Section VIII of the original paper in Biometrika, Vol. xvi. pp. 392—398, 
Dec. 1924. 


Recurrence Formulae for the Moments of the Point Binomial. 
By RAGNAR FRISCH, Kristiania. 
Introduction. In Biometrika, May 1924, Professor Pearson has given a very important recur- 


rence formula for the moments of the hypergeometric series. In the special case of the point 


binomial (p+q)" Pearson’s formula may be written 


a=2 Jp 9 8-2 /g—] F 
Bes rep 3 \ ; dei AA | ; ) mint peecsseetouctisweseues oececceee (1) 
‘s—1 2 (s—1 \ s-1) 
or BoA { 1 )e it = \4 \ ;) n(5a 1 ) Jo. GAO, .vccgeccccccsecsese 2), 
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where the moment of order s is defined to be 


Ps= Ps\ 7) = = (v —7q)* Ty, 
v=0 


where T= (") Se, DEL ccccacescses hivcsateeanemee rena (3), 
v) 
Po i, 
1 =0. 


In the following lines I first deduce some general properties of linear equations between the 
moments of the point bindmial. I then proceed to prove a system of recurrence formulae, a 
special case of which is the formula (1). Lastly I generalize the recurrence formulae of Pearson 
and Romanovsky * for the case of incomplete moments. The proofs of the last-named generaliza- 
tions are based on principles entirely different from those originally used by Pearson and 
Romanovsky. In fact only elementary summation operations are involved. By the help of these 
operations the proofs may be given an extremely simple form. The formula of Romanovsky in 
fact arises with hardly any calculation at all. 
$1. Some General Properties of Linear Equations between the Moments of the Point Binomial. 
A function #’(#) which identically in satisfies one of the equations 


may be called a reduced function of x, positively reduced if it satisfies (4a), negatively reduced if 
it satisties (4b). A positively reduced function obviously is an even function in the variable 
t=1- 2s, and a negatively reduced function is an odd function in the same variable. 


Putting p=1—g for ¢ in (3) we obtain 


\ et vib ) =v A f # -v r—-(r-v 
well—g)= = v—r+rq ‘() l—g)’q"-"=(—1)*3 r—v)—rq)*\ )v (l-q ( ), 


hence fe (Ll —g)=(—1)* p15 (Q) ........ 2000 iene uccataaonmanatenaed Ramee 3 


The even caoments are positively reduced functions of q and the odd moments negatively yeduced 
functions of q. That is: the even moments are even functions and the odd moments odd functions 
in the variable =p —q=1-— 2g. 


Now let us consider a linear relation between the (a+1) moments 


Psy Panty +++ Ms—as 


where the coefficients A;(qg) are functions of g. 


a ; A;(1-q) : : ? , , > 
The function ¢;(¢)=(- 1). 4 1) may be called the form-function of the equation. If the 
Ai(q) , ; 
form-function is independent of the subscript 7, that is to say if a function @ (g) exists so that 
: P ; A; (1 -¢ . . ; 
identically in g we have: (-1)'. (q) for all values 7=0, 1, ... a, then I call (6) a 
41j qd 


reduced equation. If no such function exists, I call (6) a not-reduced equation. 


The character of an equation as reduced or not-reduced evidently remains unaltered if the 
equation is multiplied by an arbitrary function of ¢. 


Biometrika, Vol. xv. p. 410, 1923. After finishing the present paper I received a letter from 
Professor Tchouproff who drew my attention to the fact that Romanovsky’s formula was first found by 
Bohimann. Cf. Bortkiewiez, Jahresbericht der deutschen Mathematikervereinigung, Bd, xxvu. (1918), 
S. 73. [Romanovsky’s formula was communicated by him in 1915 to the Society of Naturalists of the 
University of Warsaw. Ep.] 
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In the special case @(q)=+1 I call (6) a completely reduced equation, positively reduced if 


$ (Q)= +1, negatively reduced if @(qg)= —1. A reduced equation which is not completely reduced 
j is obviously transformed into a completely reduced equation when it is divided by any of its 

coefficients or any other function #'(qg) which satisfies one of the equations 

o(9q)-F(mj=tF (1-9), 
where @ (q) is the form-function of the given equation. 
Now suppose that we have given a not-reduced equation of the form (6). Since this equation 

holds good for every value of the variable between 0 and 1, it holds good for the value (1—g). So 

} we have ‘ 


A; (1—q) pi (1 —¢g) =0. 


Substituting the expression for p,—;(1—¢) from (5) we get 


—t 


a 
> Bg CL 9) gc GO a sactvs suncsesescncerecssncanceuncaeeen 7). 


i=0 
The iast equation evidently is a not-reduced equation. Furthermore the two equations (6) 
and (7) are linearly independent since no function @(q) exists, so that for all the values 
i=0, 1, ... a the coefficients (—1)'A;(1—gq) of the new equation (7) are @(qg) multiplied by the 
coefficients of the given equation (6). 
Thus we have the following fundamental property of the moment equations: Zo every not- 
yeduced linear equation between the moments corre sponds another not-reduced equation which is 
linearly independent of the first, and which may be written down immediately when the coefficients 
of the first equation are known. 
[t is easily seen that this property does not belong to reduced equations. From the two not- 
reduced equations (6) and (7) we may by linear combination deduce an infinity of new equations. re 
Of these however not more than two can be linearly independent. May it in this way be possible 
to obtain two linearly independent reduced equations ? 
Evidently it does not restrain generality if we suppose the equations to be obtained by adding 
to the equation (7) as it stands the equation (6) multiplied by some function @(q) of g. This 
gives 
a 


> [0 q) A (g)+(-1 Mish Ltd) Uiphignig Ooo eecncaieasscencuceoweeten 000(0). 


0 
i 


The form-function of this equation is 


6 (1—¢) ”q +1 


6(l—q) 
pi (q) +4 (q f; 


&; (q)= 
where @;(q) is the form-function of the given equation (6). 
The necessary and sufficient condition that there exists a function ® qd independent of the 
subscript 7, so that identically in ¢ 
P; q ~p q 
for all the values 7=0, 1, ... a, is that 
Og) 0 (1—G) 1.0... ccececccseccsseveccescceseevecees yaeneen 8 b), 
for the values of g in the interval 0 < ¢ < 1 for which the form-function of (6), @; (qg), is not inde- 
pendent of the subscript 7 For the values of g for which (8+) is satisfied, we have 
l 
@ (¢ J(1—<¢ 
b(q)=0(1-¢g 6(q 
Hence the necessary and sufficient condition that (8) is a reduced equation is that @(g) is a 
function with the specified properties. Furthermore it is evident that if we choose two such 
functions 6, (q) and 62(g) so that 6, (g) + 6 (q), the corresponding two equations of the form (8) 


are linearly independent. 
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So we have the second’ property of the linear equations between the mements: Any not- 


reduced equation of the form (6) may be replaced by two reduced linearly independent equations of 
the form 

a 

S (6; (g) 4g. (q) + (— 1)8 Ag (1 —@)] pres 2... -erocccevccescoees 9a), 

i=0 

a 

S [02 (q) As (g)-( —1)* Ag (1 —@)] pas=O «20. eeecreecereccersceees (9b), ‘ 

7 0 


where 0,(q) and 0,(q) are two arbitrary not identical functions of q satisfying the conditions 
specified under 8d). 


In treating linear equations between the moments one will nearly always find it easier to 
handle the reduced than the not-reduced forms. 


Application. The equation (2) is a not-reduced equation with coefficients 





s—1)\ ‘s—1\ : : 
A; q =4( c )-rgp |. 1)» U 2, 3, ove & 


Choosing 6; (7) = +1, 62(q)= —1 we may replace the given equation by the following two reduced 
equations which are written down immediately by the help of (9a) and (95), 


not v. [~ s—] /g—1\ : 
2s A [ q+ —1 ‘p ( ; )+ 1+ 1)! \ 4 )rqp | us po ae ee 10a), 
(8 Bas ¥ cs 8 
( 1 ) is = | q+ -] ‘P ( i+ 1) +(1+(— 1)! ( ; )vap | a Sicveveauseyerd 10d), 


where the summation is to be continued to the last not vanishing (“not v.”) term. The equation 
(105) is the equation obtained from (9b) when (s—1) is replaced by s. 


Writing out the formula (10a) we get 


a ed At ena eee 


ial), (a= IT/a-1y (a= 
+rqp \ ) Ms-2+\ 3 JOT + 5| | 2 js o+( { ) Matt. creme 11), 


where each of the sequences is to be continued to the last not vanishing term. 


The formula (11) may be slightly more convenient for numerical calculations than the formula 
(1). Other formulae involving still less calculation may however be obtained as I shall show in 
the next paragraph. 


$2. A general System of Recurrence Formulae. Let us denote by Hi, the left-hand side of 
Pearson’s equation 2), 


(3—1° sf /s—1 3s—1 
M=wta("y') mat 3, [95 ')= eae (574) Jae 


Calculating the first few differences of H/,, 
4H,=H,.,—H,, 
A?H,=AH,,,—A4H,, 


the following formula for A* H/, is suggested, 


wn "8 [v()e- (3) 


i=0 
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The formula may easily be proved by complete induction, It obviously holds good for £=0. 
Supposing it to hold good for * we have 
Aft! 7,= A¥ H,,, - O* H, 


= "BL e- Gar (Ga) Ge) ere 
2S [0 (Ee (a)ae (Ge eco GE) arene 
--"S [ev +G5))e-(@-Ga))d 
+L (24) (Ea) (Ga) +s) re ees 
Eo Ce ee (HME) oes 


which is the expression fer A**+! H, obtained from (12). 
From Pearson’s equation we know that H,=0 identically in s, consequently Af H,=0. 


Hence replacing (s+) by s in (12) we have the following ¢. tation, 


1ot V : k (s—k -1) (g—k—1 E ee a 
“e [ “a (7) e-( z )a+(( i—1 )+(-0) ad rap | byai=0 nn (184) 


where / denotes an arbitrary not negative integer. This is a not-reduced equation, which is 
easily seen by an inspection of the form-function of the equation*. Using the result of the pre- 
ceding paragraph we may immediately write down the following not-reduced equation, 


s[ ~1)i-1 3 . ’ - ') is (‘) q+ ( -1) (' ede ') bs (, k )) rap |p.—i=0.n(13D). 


Taking first the sum and then the difference of the last two equations, which corresponds to 
choosing the functions 6, (g) and 62 (q) from the preceding paragraph equal to +1 and —1 re- 
spectively, we obtain the following two reduced equations, 


“S[(((E2) +@) come-nea eon (G59) +(h)) no 
EL(Got)-(ha) oenrasccan (FA) ane tn 


where / again denotes an arbitrary not negative integer, not necessarily the same in (14q@) as in 
(144). The equation (145) is the equation obtained by subtracting (13@) from (130) and replacing 
(s—1) by s. 

To every not negative value of / correspond two recurrence formulae (14a) and (140). 
Putting £=0 we obtain the formulae (10@) and (10). In this special case the formulae contain 
all the moments from pop to p,. By choosing & conveniently we may reduce the number of 


. . a a “ef ° q $ . 
moments occurring in the formulae. In fact if we choose any particular value ky ( — ) of /, the 


sh ky — 1 (if (s— ky) be even) ee 
coefficients of all the moments from order zero to order (Ao % vanish in the 


\ ko (if (s—hy) be odd) J 

formula. If s be even and /=-, the coefficients in (14@) vanish for the moments from order zero 
k—1 (if $s be even) : ea 5 \ - 

to order a 9 ‘it be “tl odd) , . If s be odd we may for instance choose /=4$(s—1)+. Formula 


* The only case in which (13 a) might be a reduced equation is the case k=} (s—1 
+ k=4$(s—1) being the case in which (13 a) is a reduced equation, the special formula (15), p. 170, 
may of course also be derived directly from (13a) or from (13 b). 








170 Miscellanea 


(14a) then gives 


oral C)n+ Canes Jome[ Co lant] 
- [(:) ae: (‘) a i] eerie a les ee (15). 


This formula involves considerably less calculation than the analogous formula (11). For 
instance to calculate p43 by (11) we must use the values of all the moments from po to py. with 
numerical coefficients such as 495, 792, 924, ete. To calculate p43 by (15) we only use the moments 
from pz to py with numerical coefficients 6, 15, and 20. 

I think it should be possible by further transformation of Pearson’s equation, using the 
results from § 1, to obtain an equation only containing four consecutive moments, but I have not 
yet been able to carry out this transformation. 

S$ 3. ee neralization of Pearson’s and Romanovsky’s Formulae to incomplete Moments. The 
incomplete moment of order s may be defined as 


where a4 (’ VEO vesccinsovecssveecasecdoccassevaveveresstes (16). 
y 


lf it be desired to emphasize the lower limit of summation we may use the notation py,. 
Differentiating the equation (16) with respect to g we obtain 


s+1 


ene 5 si sf rs(v—7q s—1 T,\ ’ 
v=p dP J 
hence Pe+1 GP |7Sps—1 + ps | eeccccesescvecesecovcoveseocessoscosoes 17). 


This I think is the simplest proof of which Romanovsky’s formula is capable. It is a very 
interesting fact that the formula holds good without any alteration, even for the general case of 
incomplete moments. 


For the incomplete moment of the first order py= = (v—- rq) T, I have given the explicit 


expression in the Skandinavisk Aktuarietidsshrift*, 


Al (7 a / 
pi =ppT>=pp ) gp © ccccuvesunsevasweeees Sepewecsdasccosenst 18), 


) 


co} 


Introducing this expression in the equation obtained from (17) by putting s=0, we get 
, ) ‘lv 
po =" Ty. 


Integrating between 0 and g we obtain in the case 0<p, 


Ho=p (;) [ x0" PRE |  cocivarnnroerteeoesinsteapenens 19). 
PJ Jo 


This is the expression for the zero moment of the point binomial first found by Professor 
Pearsont. 7 and p being positive integers, the integral (19) nay of course be calculated directly, 
but the expression for yy obtained would not be any simpler than the definition 


Po 2 T, 4 
The formula (19) may still be useful in numerical applications as shown by Dr Camp§. Now 
No. 3, 1924, p. 161. + Biometrika, May 1924, Vol. xv1. p. 202, and note by Dr Camp, p. 171. 
The most interesting results of the direct integration of (19) are obtained by equating coefticients 


of equal powers of g. In this way some not unimportant identities satisfied by the binomial coefticients 
may be obtained. 


§ Biometrika, May 1924, Vol. xv1. p. 157. 


+ 
+ 
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assuming the numerical value of po to be calculated, I proceed to prove a recurrence formula by 
which the values of the higher incomplete moments may be calculated successively. 
Putting fo=( -—rqy-}, 
Iv = ( a rq) a 
we have, by partial summation of (16), 
A - - i 
Ps>= = fv9Jv=Sr = Jv— = A}, = Jie 
v=p v=p v=p i=p 
Introducing the following expressions taken from (18), 
4 ny 
= gv=ppT, 
v=p 
; 
> 9: =ppTp—(v+1) pT,.;, 
i=p 
we get 
rl 
ps=ppT pf,— pp Tp = Afy+ 


v=p 1 


r—1 r 
= P (v+ 1) ae 1Af,=pp Tofot4 > (7 - v) T, Af, 
=p v=p 


=ppTpfotq = (rg—v+rp) T.Afh=pp Tefotrgp = TAfv—q = (vg) TAf- 
v=p v=p v=p 


2 ; ‘g~] a. fe=h\., : s-1 
Introducing A}, = ( | ) (v — re \8 24-( 2 ) (v—-7rq lia ee va i) 
we finally obtain 
" 8-2 /3—] -2 /s—1 
Ps=ppT',(p—7rg)*"!+rgqp = ( ; \ wimg = ( ; Va Gh) So ncevecweesee’ (20). 


This is Pearson’s formula generalized to the case of incomplete moments. The formula only 
differs from the complete-moment formula (1) by the additional term pp 7, (p—7g)*-. Putting 
p=0 in (20) we get of course (1). The incomplete moments do not satisfy any simple relation 
such as (5), so the equation (20) cannot be replaced by such simple reduced equations as (14a) 
and (14d). 

The first few incomplete moments as calculated from (20) are 

= pp Tp, 
po=ppTp[p—(¢+1) q)+79P - pos 
ps=pp To[(p— (r+1) 9)? + 9p (27 — 1) +7 9p (P— 9) - Hos 





Review: The Elements of Vital Statistics in their bearing on Social 
and Public Health Problems. By Sir ARTHUR NEWSHOLME, K.C.B., M.D., 


F.R.C.P. New Edition, George Allen and Unwin, Ltd. 

THIS new edition of The Elements of Vital Statistics contains much fresh matter. Sir Arthur 
Newsholme discusses in an early chapter the different methods of estimating population. We 
think that he has some doubt as to the advantage of substituting “age in years and months” for 
“age last birthday” in the 1921 census. Personally we doubt whether it will conduce to 
accuracy. If anyone knows his age “age last birthday” seems to us the simplest form in which 
to state that age and one wonders how recorders deal with a portion of a month when they have 
to enter “age in years and months.” In considering “age at marriage” Sir Arthur Newsholme 
shows that between 1896 and 1920 there has been very little postponement of marriage, *88 of 
a year for bachelor bridegrooms and °40 of a year for spinster brides, and one concludes that 
such a slight postponement can have little to do with the falling birth-rate. 

The birth-rate is considered in Chapters VII, VIII, and [X. In connection with the regis- 
tration of births we should like to point out that though the maiden surname of the mother is 
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required in Britain it is not always required when the birth of a British subject born abroad 
is registered and we think steps should be taken to remedy this defect in birth registration out 
of Britain where it occurs. We cannot see that there need be any connection between the sex 
ratios at birth and wholesale prices (p. 88) since both are varying with time. Later in the book 
there is a warning on this point but it is omitted in this instance. 

As eugenists Sir Arthur Newsholme will not expect us to agree with much of what he writes 
on pp. 109, 110, and 111, though we fancy we see some “signs of grace” scattered here and there 
throughout the book and even on the pages referred to above! In the discussion on Infant 
Mortality it is assumed that a lower infant death-rate means improved health in the survivors ; 
we still dare to think that this is the expression of a pious hope which is at present unproven. 
The treatment of life-tables is very superficial *. 

We wish that heed would be taken of the chapter on the Registration of Sickness; we adinit 
the difficulties but they should not be insuperable and the same seems to us to apply to the 
problem of death certificates though we think there will always be a difficulty over international 
statistics, even if we in England adopted the Swiss system. We notice on p. 290 that “Nothing 
is more certain than the statement that the removal of poverty would effect an enormous 
reduction of disease”; and in the next paragraph but one a quotation from the Report of the 
Royal Commission on the Poor Laws (1909) which states that “Sickness is therefore admittedly 
one of the chief causes of pauperism”; these two quotations seem to us in opposition and both 
require investigation before anyone can say which is the truer statement. Later on when it 
comes to the question of infant mortality Sir Arthur Newsholme points out the difficulty of 
disentangling cause and effect when poverty and sickness occur side by side; but though he has 
pointed this out he does not hesitate to state that “Environment has a preponderant influence in 
determining the magnitude of infantile and child mortality in a community”; by environment 
is meant, we presume, poverty, housing conditions, &c. Material collected by medical officers of 
health in individual districts seems to show that the most important factors are the health of the 
parents and the care that they bestow on their children, and though we consider it not yet 
proven, we are inclined to think that the fall in the infant death-rate of late years is due to the 
Midwives’ Act and possibly, though hitherto there is no statistical proof, to some improvement 
in the habits of the mothers. 

Again we cannot see that there need be any connection between prices and phthisis or that 
it is necessary to try to explain differences between England and Ireland in this connection. 
Sir Arthur Newsholme does not assume that there is a close connection between prices and 
tuberculosis but we think that to give coefficients of correlation when there is a time factor 
introduced is very dangerous unless they are by one method or another corrected for time. It 
would be interesting to know whether the number of people attacked by phthisis had diminished 
or not but such information would be, we think, impossible to obtain, since tuberculosis is now 
diagnosed at an earlier stage than formerly. The old question of the influence of heredity in the 
development of phthisis comes up again ; some people among whom we count ourselves stil! 
believe in an hereditary predisposition but if one assumes that the association between tubercu- 
losis in parent and child is due to large doses of infection at an early age rather than to an 
hereditary predisposition the argument against the phthisical having children seems to us 
equally strong; the children will be more likely to have tuberculosis than the children of parents 
without the disease. 


In a review in Biometrika the statements in connection with the Variate Difference Method 
on p. 524 cannot be ignored. Sir Arthur Newsholme has quoted from a paper by Prof. Person of 
Harvard in which Prof. Person says that “the defective character of the variate difference method, 
as originally proposed, is admitted.” We think that Sir Arthur Newsholme has not read the paper 

* The result on p. 238 for p, has been obtained from English Life Tables, 1914, p. 60, without 


Sir Arthur Newsholme troubling to test his own arithmetic. Had he done so, he would, perhaps, 
better have understood the meaning of the ordinary life-table symbols. 
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in Biometrika to which Prof, Person refers. There was an error in the application of the method 
to the problem of infant death-rate which the authors of that paper admitted but that is a very 
different matter from an error in the method itself. The product-moment method of obtaining 
a correlation coefficient is perfectly sound but it may be applied without taking into account 
the time factor or that regression is non-linear, and erroneous conclusions may be drawn from 
the coefticient of correlation found ; such applications do not invalidate the method, end it has 
yet to be proved that the variate difference method is fundamentally at fault. We cannot help 
wishing that Sir Arthur Newsholme had ended his book at p. 500. There are many useful notes 
and “cautions” in the last 100 pages but also very much that seems to us open to criticism. We 
think the root of the difference between Sir Arthur Newsholme and the biometricians is to be 
found on p. 525 where it is suggested that “when the conclusions drawn from coefficients of 
correlation run counter to inferences on the same subject based on biological (including medical) 
considerations” judgment should be suspended. This seems to the statistician a most dangerous 
course to pursue, That a careful investigation of the results obtained by statistical methods is 
necessary we should all agree, but when this has been done we feel that the results must be 
accepted and that to accept only those results which agree with existing prejudgments is a 
retrograde step which must lead to disaster. We think it is this underlying idea which leads 
Sir Arthur Newsholme to reject evidence provided in the case of two collections of material and 
accept results deduced from another far more scanty and incomplete. On p. 558 the following 
sentence occurs: “ Another example, on a somewhat scanty scale, may be quoted from McDougall’s 
book. Miss Aylitt tested 342 children from the primary grades of schools in one district,” but 
there is no reference to scantiness of material when the intelligence of 372 girls in one school 
and of 247 boys in another are tested and far-reaching conclusions are drawn by Dr Isserlis from 
this material. Sir Arthur Newsholme assumes that the teachers in the two schools referred to 
were really testing intelligence and not knowledge and one infers that he thinks that in all 
previous investigations it was the other way round! The correlations found with age make one 
doubtful and the comparatively low association between the teacher's estimate of intelligence and 
intelligence judged by a test makes one still more doubtful, but the numbers, reduced in these 
tables to 161 girls and 106 boys, are quite inadequate. We fear that it is because the conclusions 
reached do run counter to previous inferences on the subject that they are accepted uncritically. 
This book gives an excellent summary of what material is available in the census returns and 
shows how the statistics in this country could be improved in many ways, as for instance by the 
addition of the age of the two parents when the birth of an infant is registered, but we feel 
obliged to object to many of the conclusions, considering that some are contrary to existing 
evidence and that others are quite unproven. The book is bound to be replaced at no distant 
date, by one on sounder lines. E. M. E. 


Lebenserwartung und mittleres Alter der Lebenden. 
Von E. I. GUMBEL, Heidelberg. 

Im folgenden sollen die Gréssenbeziehungen zwischen der zu einem bestimmten Alter gehéri- 
gen Lebenserwartung und dem mittleren Alter der nach der Sterbetafel iiber diesem Alter 
Lebenden untersucht werden. Fiir das Alter Null ist bereits in einer friiheren Arbeit* gezeigt, 
dass fiir alle heutigen Sterbetafeln die Lebenserwartung eines Neugeborenen grésser ist, als das 
mittlere Alter aller Lebenden. Ein Ansatz zur Erweiterung dieses Satzes findet sich in einer 
Arbeit von Pearson +t, der gezeigt hat, dass bei der Gompertz-Makehamschen Absterbeordnung 
fiir jedes Alter die Summe Alter plus Lebenserwartung grésser ist als das hierzu gehérige mitt- 
lere Alter der Lebenden. Es soll nun untersucht werden, unter welchen allgemeinen Bedin- 
gungen dieser Satz gilt. Ferner soll auch eine untere Grenze fiir das mittlere Alter der Lebenden 
aufgestellt werden. 

“ Uber eine Eigenschaft der Sterbetafel,” Zeitschrift fiir die gesamte Versicherungswissenschaft, Bd. 
xxiv. H. 4, 1924. 
+ Biometrika, Vol. xv1. p. 297, 1924. 








$ 
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Man bezeichnet mit (a) die Wahrscheinlichkeit, die einem Neugeborenen zukommt, den 
wten Geburtstag zu erreichen, sodass /(0)=1. Man definiere das héchste erreichbare Alter o—1 
durch die zweite Grenzbedingung /(#)=0. Ferner definiere man die Sterbensintensitiit p (x) wie 
iiblich, als 
r., (x) 


spe =< ° 
aa U(x) da 
Das mittlere Alter fiir die nach der Sterbetafel iiber dem Alter 2 Lebenden lautet 


| “y L(y) dy 


M(h) === 


. 1 (y) dy 


Die Integrationsvariable wird hier stets mit y bezeichnet, um Verwechselungen mit der 
unteren Grenze w zu vermeiden, 


Die Lebenserwartung wird wie iiblich definiert als 


7/..\ 1 [ 
E (x)= aa | L(y) dy. 


Man bezeichne die Summe Alter + Lebenserwartung als Alterserwartung. 
Somit gilt fiir die Gompertz-Makehamsche Absterbeordnung nach Pearson 
a (a)<a+ E(x). 
Dieser Satz liisst sich nach einem von Herrn v. Bortkiewicz* stammenden Verfahren fiir alle 
Absterbeorduungen beweisen, fiir welche die Sterbensintensitit stets mit dem Alter wiichst. 


Zum Beweise sei zuniichst eine einfache Beziehung abgeleitet, die im folgenden von néten ist. 
Man betrachte, nach der Definition der Sterbensintensitiit, 


| . yer) l (y) dy = - m a adl(y) 


=r xv)+m “ym 1] Y) dy. 
Im Spezialfall 


m=O: wird ™ w(y) l (y) dy=l (x) 


n= 1: wird [yp y l yj dy=al @v)+ a y dy=l a ) (a+ E X)), 


“ow 
2 | yl \Y) dy 


m=2: wird | PP ely)l(y) dy=a2?l(2)+— : 





- l y) dy=l 2X) (227 +24 wv) E 
J L(y) dy 7" 


(zx 


Unsere Aufgabe ist nun, Alterserwartung und mittleres Alter zu vergleichen. Man bezeichne 


av x)}, da / x >0, 


[ 
[ 
=alen | a Y—X(x)) ply) l yay | 
LJ 2 


nach den eben fiir m=0 und m=1 abgeleiteten Formeln. 


aa 
o=sign | p Y-U(e U(y)dy+ | 


Ferner ist: 


y—v(x)) ply) ly ay | 
via) “il 


L. v. Bortkiewicz, Die mittlere Lebensdauer, Jena, G. Fischer, 1893, p. 77. 
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nach dem Mittelwertsatz der Integralrechnun 


g, wobei «<< < x (x). 
des mittleren Alters der Lebenden ist 


Aber nach der Definition 


v(x) Pw 
| (y— =x (w)) L(y) dy= 7 (y—=x (x)) L(y) dy, 
: (a) 


sodass o=sign || (y—u(%)) (u(y) -p(p)) L(y ay | 
J x (x) - 


=sign pe (.v 
da wegen der Integrationsgrenzen @ <.w (wv) < y und da 1 () stets positiv. 


Also gilt: Die Alterserwartung ist grisser, gleich oder kleiner als das mittlere Alter der 
Lebenden, wenn die Sterbensintensitat fiir alle Alter wiichst, gleich bleibt oder fallt. Dieser Satz liisst 
sich aber nicht umkehren. Es kann sehr wohl die Alterserwartung grésser sein als das mittlere 
Alter der Lebenden, ohne dass die Sterbensintensitiit stets wiichst. 


Der Pearsonsche Satz ergibt sich als Spezialfall. Denn die Gompertz-Makehamsche Formel 
geht aus von 


Bw A+ Be*, 
woraus p’ (v7) = Be* log ec <. 0, 
weil B>O, 
und c>l1. 


Somit haben wir eine unter einer Bedingung giiltige obere Grenze fiir das mittlere Alter der 
Lebenden aufgestellt. Nach cinem allgemein fiir solche Abschiitzungen verwandten Verfahren ist 
es leicht, auch eine untere Grenze hierfiir aufzustellen. 


Man bilde die Wahrscheinlichkeit dafiir, 
dass ein Neugeborener den «ten Geburtstag nicht erreicht, 


Je l —l VY), 
dann gilt fiir die Momente dieser Funktion 


q w w 


y" dg y)=— | yr Aly). 
2 Bo 1 fe ; 
Man bilde nun den stets positiven Ausdruck A al | aj tay)*dg(y), wobei a; und ay 
1 xr 
vorliiufig willkiirliche reelle Konstanten sind. Fiihrt man die Integration aus, so wird, wenn man 
die oben berechneten Momente verwendet, 
A =a," + 2a, ay (“0+ E (x)) + ay? (a2 +24 (x) B(x > 0. 
Nimmt man die quadratische Ergiinzung vor, so wird 


(a; +a, (u+E (x))? > a? (w+ E (x))?— ay? (a? 424 (“) E(w 


Denkt man sich jedes einzelne Alter speziell betrachtet und setzt man hierfiir jeweils 


ay = — ay (&+ Bia 


’ 


so wird w+2a(r) EF 


v) > 22 +202 E (x)+ E(x), da ag? > 0, 
E wv) 
oder C(t) >r+-9 ° 


Demnach gilt der Satz: Das mittlere Alter der nach der Sterbetafel iiber dem Alter « Lebenden 
ist grosser als das Alter cede r halben Lebenserwartung. Fiir das Alter null gilt speziell : das 
mittlere Alter aller nach der Sterbetafel Lebenden ist grésser als die halbe Lebenserwartung 


eines Neugeborenen. Somit ist es gelungen, das mittlere Alter der Lebenden mit Hilfe der 
Lebenserwartung in feste Grenzen einzuschliessen. 











FURTHER CONTRIBUTIONS TO THE THEORY 
OF SMALL SAMPLES*. 


By KARL PEARSON, F-.R.S. 


I. On the Correlation of the two Standard Deviations and of either of these with 
the Coefficient of Correlation in small Samples taken from a bi-variate Popu- 
lation following the Normal Law of Distribution. 


THE distribution of the two standard deviations and the coefficient of corre- 


lation, namely o,, o, and r,in samples of n taken from a normal population of 
constants =,, =, and p is known to bet: 


1 ‘no,2 2urpoyos Noy” 


z= “He a-\EF” 22, a) (219 co ey tee (i). 


~—l—2 


Let us write 


eer + y/ ai = 
s,=>,V1 — p*/Vn, 8% =2,V1—p*/Vn, npo,o./S,¥.(1 - P*) = PO 02/8, 8 ; 


then 
, 71" ee rere n-4 
“ a 292. 2 ed (%22\ i te a 
S=2, ¢ leé 2" | "allt ey a! cc eR (11), 


\ 8,8 J 
where z, differs from z by terms not containing o,, o, or r, and Y = po ,0,/8, 52. 
Accordingly the frequency surface for ¢,, o, will be obtained by integrating 


throughout its entire range, Le. from —1 to +1. We require accordingly the 
integral 


Take 7 = sin 0, and we have: 
[= | = eY Sin 8 aggn—3 6dé. 
T 


2 


Expand evs? and note that all terms 
TT 
+3, 
| sin?+! @ cos” dé = 0. 
r 
Accordingly 
r2 y. ff . 
[ =?2 (1+ 5, sin?6+ 41 sint @ + -) cos” 6 dé, 
JO \ “a: } 
Printed from Lecture Notes. 
+ R. A. Fisher, Biometrika, Vol. x. p. 510, with a different notation. 
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Put a =sin?@ and we have 
ea n—-4 y? n— 4 , n—4 
= a ase iy Sane ‘aw 
| I=] (« 7(1—~2) + 5,71 — x) + 7% (1 — 2) Mena 
2p-1 v--4 
2p 
+i! “ (l-«z) +...) de 
_pfl n-2 y,/3 n-2 yp/d n—-2 
=B\ar 9 )+53(3. 2 ) 4) (3 “F) 
y? ,(2p+1 n—-2 
+o B( >> >)+ ietone (iv), 
where B(p, q) is the usual beta-function, 
1 n—2) / yf ' 3.1 
=B(5,"5-)(1+2 y aa ae 
- § /\"* Re-i* Fie hen 
y? (2p — 1) (2p — 3)...1 + ) 
2p! (n—1)(n+1)...(n+2p—8) °°", 
fl n—2\ 5 I ] 
= B (1 at Ts 
laa JU +yy In —2 + 21°(2n — 2) (In + 2) 


= 


: ad I ) 
+2. 7 ee ae —~ +... }. 
2!" (2n — 2) (2n + 2)(2n + 6) p! (2n — 2) (2n + 2)... (2n + 4p — 6) 
We can now return to (ii) and write down our equation to the frequency 
surface for o,, o,. It is 


= a," , 92" = e ee 
; 252 tae (%102\"" ‘148 ao; 1 
= Ye é : 
Y=Y ae \ 1! 5,782 2n—2 
p* a,'o.4 I 
TO pies (9D 9)(2 gy t ++ 
2: 8&8" (Zn — 2)(2n + 2) 
2p oP oP 8s 2g 2p 
Pe a argo oe ae = ) ales Je le (v) 
p! (2n — 2)(2n +2)...(2n +4p — 6) 


It is clear that in order to find any product moment function of this surface 
about the zero of o,, «, we need only to find the value of 


” e 
aD 1 oy \o 
- s (o,\? (ao \m 
I’=| e ( ‘| ‘| do, 
Jo a, 8, 
po a," ° 
=, x "e “#8 oo fet 
ae 8, / S, 
— g;* o = 
Write z=}$— or “= ¥2z, 
= Site S$; r 
Hence IJ’ is given by 
a 2p+m-1 
Pr ai 224 +m—1 gr e~? zZ 2 dz 
~ 0 
2p+m—1 
2p+m 
ss 9 
3 2p+m+1 : 
ene r( I et ME (v1). 
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Further Contributions to the Theory of Small Samples 


If N be the total number of samples, we have : 


v=[ [ ydo,do, 


6 Jo 
sins \,, (/n— 1)\ 
= y,2"-* (8,82) x ‘y . 5) ‘) + 


» (N+3 1 
+ pore (248) 


9 \ 
an ae Ee 2p+m+1 
+ pv 2” I (25 


= You" —* (5) 52) x lr ("5 : ) (1 “ 


2!(2n — 2)(2n + 2) 


(n+1 


abt te 


ces 


2 ) pi@n—2) Gn +)..Qnt4p—6) * 
n—1lp? n—1n+1 p 
2 1! 3 7 


n—1ln+1 —1+ 2p p? 
paging] s-letper, ), 


=. 


Or N=Y 


“ p: 


» (/rn—ly 
Qn-3 (S$, 8.) X hy ( 2 ) 





= 


(1—p?) * 


This determines yo, i.e. 


N (1 — p’) . 





We shall now find 


*D ra 
Np',0.= | yo,0,do,do, 
“0 0 
a ee ee 
= 5,18, | | gS tgts 
0 J0 S,; So S$) S 


p* * n+4 l 
+5" e ) nD 
a p : on» = =P) 

p: yA 


nv é 
= $78.2 Qnr-2T2 (1 - E 
Fo% es (5 4(n—1). 


kn (4n + 1) dn (4n +1) 

t(n—1)$(n4+1).2!? 
dn(4n+1)...4 
5 ( 


3 ~2pe ( 
= Yo8,78.2 2” T? (5) F(4n, 4n, 


+ 2) 


(2n — 2) (2n + 2)...(2n 7-8} 


(n+p)$n(3n+1)...$(n+p) ,, 
4(n—-1)4(n4+1)...$ (n+ 2p—3). p! tli 


(n—1), p’), 
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where F(a, 8, ¥, p*) is the usual notation for the hypergeometrical series. But 
by Euler’s Theorem : 
F (gn, 4n, $(n —1), p?) = (1 — p*)- 2 F(A, 4, $(n —1), p®). 
Accordingly we have : 
, aaéasead ae . 
Np 0409 = YoSy Sy Qn-2 [2 (3) (1- p”) a+) x (p*), 
/ ? ] a l 
om a 2 f 

where H(')=\1 + Tig 2 + 91 Ga —2)Qnad)* 
e ! 


+p! Qn —2) (Qn +2)... Qn+4p—6)t ) ——r 
Accordingly 


Returning to Equation (v) I now proceed to integrate it for one value o, of the 
two variates only. We have 


eD 1 ‘ 
; - 3 = 2 om t— 
¥do,=ye *1 ( ) <e 
I, J S$, / 
i D = (G\""*% p® o,"aj? 1 ce 
x} ¢ s(—-) 2>—— "7S = 5 » a (2) 
Jo \Sp / 0 Pp: SPs? (2n— Z)(2n4+ 2Z)...(2n+ 4p —§) 82 
1 di nm—2 x 2p fon } 
=y8e  *1 ( ' xz? f ) 
ali KS, oP!\s 
2p+n--3 
= i. 
A 2 2 r { . 4. ) 
a P) (n=) Gn + 9)..Qn + 4p —6) 


n-3 g, \* 
9 « pe — L\ -4(2) (ar\"" fy ae p* (<) n—1 
&,/ 2(n-1) 


+£ (2) (rn —1)(n—2) ) 
2!\s,/ 2?(n—1)(n—- 2) | 
Oy" 2% 
c 2 7 n—1 (C7; willing —4 <3 2P 3.2 


n-3 n--2 
1 no~ 


9 > 
YoSo2 an e * 2 [ an T (n— l 

y= ; < \ 

y n-2 \S ee ae 
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or finally substituting for y, from (viii) : 


n-1 . \ 
a no," 
Nn -4 pe (% a3 ; 
y = —_.——_—_—_———-e c 
y ea é \s Roldnawticenasatmaieaen (X1). 
= r As s -“) 
5* ( 2 ) ™ 


This can be checked by integrating with regard to o, and gives at once 
D 
| ydo,=N, as it should. It is of course the usual form for the distribution of 
0 
standard deviations in samples of n from a normal univariate population*. We 
now proceed to find 
® cD 
| yo, do, and | yodo, 
0 0 


or as we may write them : 
Nop. and No, pw’. 


pe -4 no," ‘oO u—1 a, 
i T haa as , 2 (21 ~ 
We have Nam=y3t | ¢ * () d (3). 
where y, is written for brevity for 
n-1 n 
ac a = =i 1\ - 
N n “ I = —) 
2) 
n—2 
S29 2 \ 
‘ Y ace tac y [ie 
or substituting for y, 
n 
r 
IE 2) . 
=S ee ree EETTEETEETTE TTT TTT Tee xii 
oi, fi oe np (e—1) 
—a 
, , Noy ae 
ax ail { \ 
= r _ x a 2 32 07 C7 \ 
Again No, M2 = Yor |. e 1 (s ) a(S ) 
7 on <1 
n—1 
Yor? © petty 
a n-1 z ) : 
2 
n 
or, on substituting for y,, we find the familiar result 
P s, n—-I ee 
ie TE a ncsencnincdnaveeinnsiauebandan (xi1l). 
n ’ 
pd ,(n ,(n—1] 
Hence writing A, ='(5)/1 >)» we have 
2, 2 
9 21" 9 . 
0, fy = Og, = . a scuba ckscaseeenciete (xiv). 


+ 


Biometrika, Vol. v1. p. 304 and Vol. x. p. 522. 
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Accordingly 


and as the correlation coefficient : 
‘ : 
m _ P a0, — “a, “oe, 
? O1% ar e od , 
Te,Fo, 
we find 
2x2 (H(p*) — 1) sai 
— ia XVi). 
We shall now proceed to investigate the value of the factor 2A,2/(n — 1 — 2d,°) 
when n—1 is large. We shall have to be cautious because if 2),? starts with the 
term (n— 1) everything will depend on the second term in the expansion. We will 
therefore expand 2, in terms of e= }(n —1). 


A good iormula for ['() is that provided by Schlémilch*, namely : 


11 1 ] we 
yr $f) = 1 o$ ZL— 1 Y¥@LZ— @ —— —_—__-——— ATC. Xv q 
log T' (a) = $ log 2a + ( 3) log a + 5957 360 pa D@sa te (xvi) 
Now log An = log P (5) —log P (“ = =) 


—] , —] 
= log l (“> +4)—log P(" 5 ) 


a log (e+ 5 )- log te); 
and this equals, by the above formula: 


1 
elog(e+})—e—3+ 


1 
3 wel’ 


—(e—})loge+e— = - + ete. 


rE 


1 ee 
or log An = $ log € + € log (1 +; ) —}+ terms in - ete. 
é 


pitied oe ee 
=sloge+}—- g. — 2 + terms in | ete. 
O€ e& 


1 
ae 1 
, Hence AX, = Vee %* a We (1 —- ) + ete. 
de 


Ve 


to our degree of approximation. 


* Compendium der hiheren Analysis, Bd. 1. 8. 264. 
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Hence 


and 


2(n—1)+ ete. 


a=. € 
Accordingly when n is large : 
Ye,0, approaches (2n — 2)(H (p*) — 1) 
= p’+ terms in inverse powers of 2n + 2, 2n + 4, ete. 

This approaches the value p? given long ago* for large samples from material 
with linear regression and homoscedasticity. 

In the ordinary theory of probable errors it is usually the mean value of the 
product pss, that is involved, i.e. 04,04, 1o,0,- But o,, oc, are known and have 
been tabled for small samples. It remains to table r,,,, and see how far it diverges 
from the value usually assumed (i.e. p*) for it. 

The following table gives the values of lq,0, in samples of size 2, 3, ... 25, 50, 100, 
400 for correlations p in the sampled population of ‘0, ‘1, *2,... °8, *9. 


TABLE I. Values of the Correlation of Standard Deviations in Samples for Different 
Values of Variate Correlation in a Sampled Normal Population. 


Values of p. 














‘0 *] 4 3 "4 5 6 a ‘8 9 1°0 

2 ‘0000 =*0088 0352 0794 | *1420 2236 *3249 ‘4470 “5912 *7587 "9514 

3 ‘0000 0092 ‘0367 ‘O828 | +1484 +2323 3369 "4622 “6094 °7795-— | °9737 

y ‘0000 0094 ‘O3B75* ‘0846 > +1509 2369 *3430 “4699 ‘6184 *7893 "9837 
5 0000 *0095 ‘0380 ‘0857 | °1528 +2396 3467 ‘4749 *6235* , °7948 *9890 
6 ‘0000 -0096 ‘0383 0864  *1540 *2415 3491 ‘4775 +| .6268 °7981 "9921 
7 ‘0000 =*0096 “0386 ‘0870 | *1549 *2427 3508 "4794 6291 “8004 “9940 
8 ‘0000 = =*O097 ‘0388 0873 | *1556 "2437 3520 *4808 *6307 “8020 “9953 
9 “0000 = *O0097 ‘0389 ‘O876 | ‘1561 "2444 5 "4820 ‘6319 *8031 “9962 
10 “0000 = ‘0097 ‘0390 ‘O879 | °1565 "2450 3537 *4829 6328 “8040 “9969 
11 ‘0000 “0098 “0391 ‘O881 "1568 *2455 3543 *4836 6336 “8047 ‘9975 
12 “0000 =‘0098 ‘0392 ‘0882 | *1571 "2459 3548 “4842 *6342 *8053 ‘9978 
13 ‘0000 30098 “0392 0884 | ‘1573 "2462 "3552 "4847 *6347 *8058 "9982 
14 ‘0000 =-0098 0393 "0885 | 1575+ 2465 3556 "4851 6351 “8061 “9984 
15 ‘0000 *0098 ‘0393 *O886 | *1577 2467 3559 "4854 63557 | -8065- | +9986 
16 ‘0000 . ‘0098 0394 ‘O887 | ‘1578 2469 3562 "4857 "6358 ‘8067 “9988 
17 ‘0000 = “0099 0394 ‘O888 | +1580 2471 3564 -4860 ‘6361 *8070 “9989 
18 “0000 = -0099 "0395 ‘O888 | *1581 2473 3566 "4862 ‘6366 "8072 “9990 
19 “0000  -0099 0395 ‘0889 | 1582 2474 3568 “4864 *6363 "8074 “9991 
20) ‘0000  -0099 *0395° ‘0890 | ‘1583 2AT6 3570 ‘4866 *6367 *8075* | 9992 
21 ‘0000 | -0099 “0395 ‘O890 | *1584 2477 °3571 *4868 *6369 ‘8077 *9993 
22 ‘0000 | -0099 “0396 “0891 *1584 2478 °3573 ‘4870 ‘6371 *8078 “9993 
23 ‘0000 . ‘0099 ‘0396 ‘O891 °1585 * 2479 3574 ‘4871 *6372 “8079 "9994 
2h ‘0000 §=“0099 ‘0396 ‘O891 *1586 2480 3575 ~| °4872 6373 *8080 “9994 
25 ‘0000 | *0099 ‘0396 ‘0892 | ‘1586 2481 3576 *4873 ‘6375- | *8081 “9995 
50 “0000 = *O100 “0398 ‘0896 | +1593 2490 3588 ‘4887 ‘6388 “8091 *9999 
100 “0000 =*O100 “0399 ‘0898 | °1597 2495 3594 “4894 "6394 “8095 * | -99997 
400 ‘0000 | 0100 “0400 0900 | *1599 1499 3599 “4898 6399 *8099 = 100000 
p? 0000-0100 0400 | *0900 | +1600 2500 | *3600 | -4900 | *6400  °*8100 | 1°0000 


Biometrika, Vol. 1x. p. 4. See also Phil. Trans. Vol. 191, A (1898), p. 24 
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Il. On the Correlation of Deviations in the Standard Deviation of a Variate and 
the Correlation Coefficient of Variates in Samples from a Normal Population. 


We start from the general surface for the three variates, i.e. : 

. faa? , (2\" n-4 ; . 

, a 3 8 ) . s (..) yr ° 9 a} fie ‘Ox cdi 

Z=%e : e - ey" (1—77) ~ ) ( ‘ 

a; ee 

and integrate it for the variate o, between the limits 0 and o«. We shall thus 

reach the bivariate frequency surface for ¢, and r. In order to do this we must 
expand the exponential e’” and find: 


/o,\?2 ‘o\2 
0 ae fet. \ n—2 n-4 po —3 (3 — p \P 
0 8), J0 So / 0 p! 8) 82 
Remembering that the integral 
~\* , n-3+p 
eS wt > n—2+p a 
| e (2) (2) do.=s,.2 2 LP" st), 
- ) 
0 2 \ = 
we can write for the frequency surface of o, and 1 
fe,\? n-4 n-3 
, / -4 (7) et aie ne 2 ~~ 
S = 2,80 *] (=) C= 3). oF k \ 5) ) 
om \2 2 =— ‘ny / \4 
ey” g n—1 (rp) 1) +o" bedi | 
{ 2 2! 8 2 2 4! \ 3) ) 
n (rpy o,\° , a (*) + 1 = 
/2 ) 92 ame 4 
+0 (5) rp = i! +25 31 *) +2 s(5 + 1) ; ) + of XViil) 


The vciiaiies of this surface for r and a, is at once obvious, and it is not easy 
to form a mental picture of the form of the surface for small values of n. On the 
other hand we shall find it relatively easy to evaluate the integrals 


Tw -[" a sdrde,, f= [. . [ erardey, os -{" |, eo.drde, 
la=[ (" ¢r*drdo,, and I, ={' | “gra,drde,. 
-10 d 1/0 


The first four will check results already known, and the last will provide us 
with what we are at present seeking. When we integrate with regard to 1, all 
the terms in odd powers of 7 disappear and we may take for the even powers the 
double integral from 0 to +1. Remembering that 


n—4 
n—2 


el 9 

H eae , (2p +1 2 
9 —_ 4 - 2p _—_ B i 
2] (1 —r*) rP dy ( = 5 ) 


2 = /2 wf 
I (" Y, “pts 
sii’ 2 + /F( 








184 Further Contributions to the Theory of Small Samples 


we easily find 


o,\2 n-3 
+} = ( ) a—2 —— > ha 9 
es 3 | - o s p—n/n—2 
[ 2dr = %'s,e ‘1 ( ' 2° Vel | = ) 


/-1l1 \ 8) \ 
( p? (a,\" 1 p* =)" 1 ) 
° +E (Sa #\s) ait | 
3)" 2 ’ = 1 
= Sq Sal i/ @ “2 Var l \ 9 (2) 
ate n—3 
-—43(1-p*) ——. A n—2 7 a 2 ; 
= Sq 8,6 at (&2) Vr2 77 (" : ) acaneveateecaes (xix). 
S, 2 


If we integrate next with regard to o, we have to find 


at 
cee ere 
e a | ) do, , 
0 7) 
which equals 
1 n—1 
n-3 v a 
n il ( ? ) 
3,2 Ps 
tk—p') * 
Accordingly we have 
-1 
; 3 , _. n{n— 2 ,, frn- DL a? 
N = 2588. V 7 2” tt jl ( 2 )/a-e) e  eowemeed (xx), 


which checks with earlier results, 

+1 

We will now investigate | z‘rdr, which throws us onto our second series in 
=} 


(xviii) for z’: 





a7 » 2 ~ 4 
a) x l\s am "+ *) 
2 
\ n p” o,\? n(n +2) p* o,\4 ) : 
)y ( ™ 
ars oir (3) (n+1)(n +8) 2! es) + 


If we divide this result by that in (xx) we obtain the mean value of r for 


a given array of r’s corresponding to a,, i.e. 


p r (5) o, ~3" (2) f “i p" A) 


7_—_" p(s 8; ” ( n+11!2 5, / 
2 
n(n + 2) p’ (a,\ ) ae 
n+ l(a +3) 212 (=) bali Se aT Dae (xx). 


The regression line is therefore non-linear. Let us see what form it takes as 


we increase n. Since s, = =, V1 —p*/Vn, we must approach the limit with caution 
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for s, becomes zero as n increases. Let us write «= }p*o,°/((1 — p”) =,°) which will 
be finite. Then we have to find the limit of 


flAnT 3) 
Vs 
3 2" (5 o; ‘ ( n NK n(n+2) 2x? 
‘9° SS : — em 1+— -+ h : “ay 
P(g(n+1)) 3, V1-p? \ n+11!° (n+1)(n+8) 2! 
n(n+2)(n+4)  n*x? ) 
(n+1)(n+3)(n+5) 3! Sir 


But Vin P (4n)/0 (3 (n +1)) takes the limit unity. 


Writing n« =x’, we have now to find the limit of: 


U 


e n kK n(n+2) KK? n(n+2)(n+4) x’? 
L=<«~ — — +7 pes 
? (1+ 097 Tit (mt Dnt 8) 21* GF] (n43)(n4 531" ) 
Expanding the exponential and multiplying out we have : 
woe 1 A 1.3 K” 1.3.0 es 
“2s nm+11! (n4+1)(n4+3)2!) (n+1)(n4+3)(n4+5)3! °°" 
n «K nW1.3 K? ake Se a 
Cee — po oe 5 > See ie ae ee 
mth! (n+1)(n4+3)2!) (vn +1)(n4+8)(n4+5) 3! 
Hence proceeding to the limit we find: 
e.t.2. 424.8 





L=1-;,+ a7 * 3) K+... 
~—4 / P co; 4 
=(1+ 2x) t=(14 —? — :) ; 
¥ \ 1 —p’ >? 
Or as n grows large we have nearly and more nearly : 
= o 1 sere 
F., =p = AT ET (xxii bis). 
~ a ef F%1\ \% 
rr) 
—) 
Thus the equation suggested by normal correlation : 
= e Pp Or, = 9, 01 — GO} 
T.,- TF =-—= —(0,-—G,)=p.(1-p* = 
% N 2 . 1 1) Pp ( P ) - a 


i.e. the usual value for linear regression, is seen to be incorrect. The true regression 
is not linear, even for n indefinitely large. When p=1, r.,=1, from (xxii bis), 
as it should. This result seems of considerable importance. For, while the distri- 
bution of ¢, approaches closely to a normal curve when n is large*, and the values 
of 8,, 8, for the distribution of 7 approach the values 0 and 3, i.e. the normal 
values, for moderate values of p and values over 50 of n, they do not do so for p 
above 0°9. In other words the limit for n large in the case of (xviii) is not a 
normal surface. The appearance of such a regression curve as (xxii bis) seems 
to me of the highest importance, and provides a warning against the too ready 
use of linear regression for frequency constants. 


* Biometrika, Vol. x. p. 524, 
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If we now integrate the expression in (xxi) for o, we obtain 


n 

rm (")1 

r— Qn-3 j \2 | \ ~ 
NF = 258,82, 2"*p Var P (" +1) 





2 
/ —— Stews cr 4 
x(1 ‘ A oS p* on (3n+1)3n($n+1) p* oY 
$(n+1) 1! $(n+1)}(n+3)_ 2! / 
But the series = F (hn, }n, }(n +1), p*), 


where F(a, 8, y, x) is the usual hypergeometrical series. Using Euler's transfor- 


mation, we find: 


tm n\ r (*= 2 
for » ee 9n-3 J/ ») \ 9 
2882 2° "pp V1 a/ y- 








—_ ee a 
2 r (*4*) F222 +0), P*. 
¢! — p’) ig y 4 y 
Whence dividing by NV from (xx) 
mG) 
iia - ee , oa 
; Perea p (uri F'(3, 3, $(n +1), p’) b bteieecaen (xx111). 
\ 2 ) \ 2 ) 


This agrees with Biometrika, Vol. x1. p. 336, Equation (xxv), if we change the 
notation and observe that the symbol g, there used is the same as B(4, $n). 
It is extremely easy in the same manner to find Ne,. All we have to do is to 


integrate the expression in (xix) for o,do,, i.e. 


ei f? / " ™ : ,({n—2 > —43(1- p(t) o,\"" oO; 
Nz, =| | g'a,drdo,=2,83&%Var2 * | ( 5—)* | e 3) (2) d | ) 
-1J0 a sf JS Ss; \ 51 
2n—-5 
, 2 >, 
2o%'8e Na 2 r n— =) r (") 
(1 — p*)" \2 \27° 
Whence getting rid of z, we find: 
n n’ 
Er ( } Tr | 5) 
8, V2 \2/ eee \2, ; 
ra ae ee eee (Xx1v). 
V1—p p n—1 np n--1)\ 
\ 2 2) 


This agrees with the result (xii) given on p. 180. 
We now proceed to find: 


Np' re, = a, do, | | e. dr. 


J0 J—1 


e£ 


We return to Equation (xxi) and integrate with regard to o,do;: 


(n — 2\ 


n-4 r\ } ~@ e 2 

; / - n\ n p> (a,\° 

i aalis ery x2)y (2) *], f+ wei tea(s) 
2 
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1 2 . 
mosey ot (ait bet (8) (ata (2 
(n + 1)(n+8) 2!2? \s, ) a - 
n—2 n 
2n-—5 I = 
2, ) 
= 28822 ? id | pvr 





/ n+1 pn p n+1n+3 n(n +2) p* ) 
2 _ 2? + . 
Nir? SS" Sosa 2 2 (n+1)(n+3) 212 
2n-5 > 
28.8; p Vw 2 . r(- ) (5) 
Or: Np’ ve, = - = a 
(1 — p?)? 


"rs ) 


We have already given in this paper the value of o,,= 2,(n—1—2A rn) | In, 
see Equation (xiv). In Biometrika, Vol. x1. p. 335, the value of o,? is given, but it 
may be well to deduce it by the methods here used. Clearly we must integrate 
2 for r°dr, or better for (1 —(1 —7*)) dr and then for do,. It is seen at once that 
only the first series on the right of Equation (xviii) concerns us for the second con- 
tains odd powers. Accordingly we have 


+1 
N (¢2 + 7) = | 2 (1-—(1 —7))drdo, 
=i 





n-2 xn-3 1 o,\* 
rl “TL = - —_ L\ —-%3 - oc n 
= N = 229% | (1 — r*) > 2 2 ] (" 5 ) e (2 ) (=) 
J0 /0 = , 
a: (n—1) , .(%1\ (n—1)(n+1) rip! (2) lard 
T “Tig Pah 4! rae ee 
We have now 
2 
-y —-— 1 2p 2s n+1 
2 | (l—?r) * rvdr= r(*3 nd r 5)/ r >! }; 
0 iad ; 
and accordingly 
n-3 [ i } 75 *) Vor po 4} (ey 7,\n-2 
N(oZ+P)=N—252 7 .— = Mi sl ') 
N(o2+?7*) (n+1\ I, ' - 
I ( a 
3 ,e-1 (Gy 3 1 gu—1(n+1) J) +...be . 
1+3 wei nyt 2"? 4! (3 n+1n+3 i 


2 2 2 
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We can now integrate with regard to o, and we have: 


aft\ os n—1) 
r(5)! \>) 





N (624+) = N —%8,8, 2" Var 


n-ln-|]1 p” ] n-l1n +] 7) n+1 i l 
ee ae p ; 
+3 “¥ tael’ = ¢ ¥ YT weet tT 
. 2 2 


—1 
r (5 r n 
‘ >) \ 2 ) n—-1 n-1 n+1 


= / 
= LV — 2)8,8. Var 2"~* 4 as 
N 05) 8 ‘ in + l I ( 2 ’ 9 ? » ’ P*), 





or by Euler’s transformation : 


a) (n 9 ip ant 

Zy 8 82 Wor Qn-3 I (3) | ( 9 ) 
n-3 r n+ 1\ 
N ( l —p*) 2 ( 5 


Substituting for V from (xx), we find: 


,(n\ ~~ (rn—-1 
rr (Cs) 
~F(1, 1, 2 (n+ 0), p*) 


,(n—2\ , (nt+1 
i a le 


—* Fa, Ds BOE OAD cccccirecessecsins (xxvl). 


of +r=1— 








of =1—7—-(1—p?*) 





ti 
=1-P-(1-/° 
\ P’) n 
With a different notation this accords with the result in Biometrika, Vol. x1. 
p. 335, Equation (xx) bis. 


We can now write down the value of j,<, =D re, — 6, from Equations (xxiii)— 
(xxv) and we find finally, if 


r n+1 \\ 
—— i eon 2 2 )\ | 
Die (1 ao" (2, “2 fF )) 
that Tro,= P 


n—1—2),2 _ ...(xxvi bis). 





Noting (xxiii), this reduces to: 


2 (p ae r) wt 
Tray = A Teer eee Pee (xxvll), 
n—-1—2n,? o; 
and o, must be found from (xxv). 

As the values of 7 and o, have been already found* and the radical has been 
computed for our earlier work in this paper, there is not much difficulty in 
obtaining 7,¢,. The following table gives the numerical values, and shows the 
degree of approximation to the usual value p/¥ 2. 


* Biometrika, Vol. xt. pp. 372, 379—401. 
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TABLE II. Values of the Correlation of the Standard Deviation of One Variate with the 
Correlation of Both Variates in Samples of Different Sizes taken from a Normal Population. 


Values of p. 





| 
i 
| 

e. 4 J 2 ‘2 
| 


i ¢ | 


> 
= 
~ 
S 
~ 
~ 
S 


. 


“0000 | -0481 “0958 "1431 "1893 “2340 *2762 “3141 *3438 | °3532 
2299 


“0000 
3 “0000 | -0581 “1156 °1726 *2284 22 *3328 “3779 *4122 | °4188 “0000 
4 “0000 | -0620 *1237 "1847 *2445* | °-3026 *3575* | *4073 "4469 | *4608 “0000 
5 “0000 | -0641 "1279 1911 *2533 2138 “3717 *4250- | -4696 | °4931 “0000 
6 0000 0654 *1305* | °1952 "2586 *3210 3809 "4369 "4857 | 5179 4134 
7 “0000 = -0663 "1323 "1979 "2626 5260 3874 *4455- | *4876 | °5368 “5069 
8 


“0000 -0669 "1336 "1998 "2653 “3297 *3922 "4519 
: “0000 | “0674 °1345* | °2013 2674 3325 "3959 
10 “0000 | -0677 1353 °2025- | °2691 *B347 


*5067 | 5512 *5550- 
"4569 | *5137 | °5624 *5845 * 


“3989 “4609 *5194 | °5712 “6045 * 
11 “0000 | -0680 "1359 "2034 *2704 33657 | *4013 "4642 5240 | 5783 | 6189 | 
12 ‘0000 | -0683 "1364 *2042 °2715~- | 3380 *4033 


“4669 "5278 | ‘5841 *6298 
3 "0000 | -°0685- _-1368 2048 2724 *3392 *4050- | 4692 *5310 | 5888 6383 
4 0000 | -0686 °1372 2054 *2732 3403 4064 "4712 “5338 | °5928 6452 | 
15 ‘0000 | -0688 °1375- | -2059 °2739 3412 "4077 *4729 


Size of Sample. 


| -5361 | 5962 6508 
16 ‘0000 | -0689 "1377 -2063 "2744 *3420 “4088 "4743 «| 5381 | °5992 6554 | 
17 -0000 | -0690 *1380 -2066 2750- | °3427 “4097 "4756 *5399 | °6017 6594 | 
18 ‘0000 | -0691 "1382 -2069 *2754 3434 “4106 "4767 5414 | *6040 "6628 
19 “0000 | -0692 *1383 -2072 2758 "3439 “4113 "ATT7 "5428 | -6058 "6657 
20) “0000 | -0693 °1385+ | -2075- | -2762 "3444 -4120 “4786 "5441 | -6076 “6683 


21 “0000 | -0694 "1386 -2077 °2765- | °3448 *4127 


27 *4795~- | °5452 | 6092 “6706 
22 “0000 | -0694 "1388 *2079 *2768 "3452 “4131 “4802 5462 | °6105* | °6726 
23 “0000 | -°0695- °1389 2081 2771 "3456 *4136 “4809 *5470 | °6118 “6744 
24 “0000 | -0695* +1390 “2083 2773 “3460 “4141 *4815- | °5479 | -6130 ‘6761 
25 ‘0000 | -0696 "1391 °2085- °*2775* | *3463 *4145- | +4820 | 5487 | °6140 6775 ~ 
50 “0000 | ‘0701 "1403 *2104 "2802 *3500* | °4195* | -4886 *bD76 | °6258 6936 


100 “0000 | -0704 *1409 2112 *2815* | °3518 *4219 “4919 ‘5617 | °6314 *7007 
400 “0000 | 0706 1413 *2119 


"2825+ | °3531 "4237 "4942 *5647 | 6350+ | “7055 


: — 
p/V¥2| 0000 | -0707 | -1414 | 2121 «= 2828 | “3536 | 4243 | -4950 | “5657 | “6364 | -7071 





It will be clear to the reader that this table possesses more singularities than 
that for 72,.,- 





The maximum value for r,,., for a sample of definite size n may 
be reached only at p =1-0, or it may in the case of samples of 8 or less be 
reached earlier. Further it may be doubted whether for practical purposes we can 
use the ordinary value, ie. that for “large” samples (p/V2) at n= 25; certainly 
only roughly for p= ‘7 and over. We might possibly for practical purposes fix the 
limit of small samples at n=50. If we want, however, the probable error of such 
a simple constant as the regression coefficient, R =re,/o,, for a small sample it is 
needful actually to know not only {8ré0,}, {5rd}, {Sa,80,}, but we clearly require 
higher products such as {8a,°}, {8rde, d2}, {Sa,50.°}, etc. and also the fourth order 
products to obtain o,? to what is after all only then a second order approximation. 
If we go to the roughest form of approximation only, we have*: 
2 2 2 955 } 9 5 -} 95 ) 
wat WE 5 So Ly SS tet 


* A short quantity ~ over the letter indicates that the value should be that of the population 
sampled. 
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Now such an approximation will show us how far the use of large sample 
values for o,, %,, Fe,» {80,80}, {8o,8r}, {8o,57} modifies the customary value 
of o,. 

To illustrate this let us suppose p= “6 and the sample to be of size 10. Then 
the value of o, for a large sample would be RV1- p(pVN)*. This equals for 
our case *4216 R. 


We will now insert in the above formula the values of the several terms: 


, QR 
ep Oe. setae 
= — = 3925 ft, 


p= 
Fer _ -9903+, 7% = -22031 
2203+, 22 
a; ° o» Ki 


{80,802} = 04, 00,1 10, = F152 (2203)? 3537 = 6,4, 0171,6580, 
(80,67 } = 06,0; Type, = PF (3925) (°2203) 3989 = FF, 0344,9199, 


{80,87 | = 04,0 Tro, = P Fz (3925) (2203) 3989 = pe, ‘0344,9199, 


these last three values being deduced from the present paper. Accordingly the 
terms in {ée,6r} and {8c,8r} cancel and we have 
o,2 = R? (1540,5625 + 2 x -0485,3209 — 2 x 0171,6580) 
= R x -2167,8883, 
or op = 4205 R. 

This is so close to -4216R found for o, from the usual formula for large samples, 
that it is clear that grave error would not be introduced by using the ordinary 
values of o,/) and o,/o, together with that for {d0,5e,} in the case of n= 10. It is 
a different question, however, whether it is legitimate to neglect the contributions 
of the third and fourth order products to o,?, when 1/N is of the order 1/10. The 
answer to this question cannot however be provided until these higher products 
have been studied and we know their numerical values for small samples. We 
shall return to this point in our third section. 

For the special case of r,,, when p= 1 we require to ascertain a limit. For in 
this case 7 will equal p= 1, and a, will be zero so that the ratio (p — 7)/o, takes the 
indeterminant form 0/0. In order to evaluate the limit we require to expand 7 
and o, in powers of ¢ where p? = 1 - and ¢ is made very small. First let us con- 
sider 7 in Equation (xxiii), we have 


7=V1 —evy, F(, i 3 (n +1), 1l-—e), 


rm ~ (n\ /(,(n—-]1 ,{n+1\) 
asia Yo=T(5)/1r( 2 jel ( 2 yy: 


: dF\ &/@F 
Hence 7F=(1—}e—- e+...) \F, -( Za ( eset > 
2c 8 Wr, { 1 da ' 2 da? { 
Phil. Trans. Vol. 191 A, p. 245 or Biometrika, Vol. 1x. p. 9. 
+ Biometrika, Vol. x1. p. 386. 
t Ibid. Vol. x. p. 529. 
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: uF) . sad : 
where F, stands for F (3, 3, }(~+1), 1) and (5 :) for the sth differential 
Le yy 
coefficient of F'(3, 3, $(n +1), x), # being put unity after differentiation. 
But it is well-known that 49 
C(y) UC (y-—a— P) 
F a, ; ; l)j= = 7 > 
Bs I(y—a) U(y—8) 
or we have 


F.G,4,3(n4¢D,1)= Ps(r—-1)TEm+ ), ae 


| rt) 


a ar ee are {,_ 1 /dPy L (OE 6 
Thus r=(1 se— 4¢ + ...) ‘ F (as) ¢*+ ar da® ley —_ 


Further it is easy to show from the differential equation of the hypergeometrical 
series that: 


l ed A a8 Sie iat 
F, de), y-a—-B ee "2 (n- 8) n case, 
1 oF aB(aB+a+B+1) _9 1 aed 
Ff, (aes), (y-a—B-—1)(y—a-—B-2)’ 4(n—3)(n—5) _ 
Y Y 


Hence we find 
1 2n—-1 


pF Se —-8)' 8@-3)@-5)°" 
2 9 > 
: ee 4-2, (n—2)(n—38) ‘ 
and F=1 23° jaan’ 
Accordingly 
—_ n—2 (2n—7)}(n—2) , 
= = _ ; ee 
*a-8 8a-8G-5 
We now require 
=e 
(1 —,*)- F(A, 1, 4a +), p*), 
| | ied 
n—2 ( (dF; 2k, ) 
; | pel) Ke t) mah 
= “n—1 Ve ¢(a),+ 2 \ dat 1 } 


ds if ae : ig : 
where F,= F(1, 1, $(n +1), p*) and (2) =the sth differential coefficient of 


a) 
1 


F=F,(1, 1, }(n+1), z) with « put unity after differentiation. 


’ 7 n+ 1)\ ,/n—] f/_/n— 1 n—I 
But F,=0( 2 JE 2 yr 2 }+==> 


7. a. 
and F(a)" Pay a 


Thus from (xxvi) we have 

ors 2(n—%) = Sse — 
: \(n—3)(n—5) 2(n—3/P(n—5)) 
a 2)(2n — 5) 


(n— 3) (n—5)- 
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Hence from the values provided on p. 191: 





od : (n—5) < ew 
bs oe "V Te -DGans) (xxix), 
a ~ (n=3) a 
or “@, for — o / cs —. / a : ererrcr © © 6 oA 
; ry er \ n—1—2n,7 2 (n — 2) (2n — 5) : 


When n=5 this limiting value is zero. For n=6 and higher values, it is 
always real and positive. For n= 2,3 and 4 this method of approaching the limit 
fails—th .alues are imaginary. These three cases require separate investigation. 

) 
In the case of n = 2, 7 = — sin“'p * and if p=1—A, we have: 
T 


e T a 
an~p= — 9 gn 
sin~'p g-* sin / 5° 


Therefore r=l1- 


But for n = 2, of=1—FrF, or 
, SrA 8 
G;= (5 ) ——A+ 
w\2/ TT" 
ai == 
Accordingly p = VV2r ( 1y)! ME. x scaa aed ee emeneameween (XxX1), 
Cc; - 


and this is zero when X= 0. The important point is that * starts with powers of 
AF and not with integer powers. 


We next turn to the case of n= 3. Here we havet: 


— eo . ; 
F =—(E,— (1—p*)F) | 
a Se, eee (xxxi1), 


—_ 3 
of=l1— r+}. Ptoge(1 — 6°) | 
p ; 


where /, and F, are the complete elliptic integrals for k*=p*%. The difficulty of the 
limit here is that as p approaches unity Z, approaches unity and F, approaches 
infinity. But without proceeding to a limit we have by the tables of the complete 
elliptic integrals : 

p = 95, F ="894,94275§ , 0, = *270,79898, 

p= '999,8477, 7=°999,2484 , o, =°016,41104, 

p ='999,9985, 7 =-999,987,813, o, =002,30196, 


* Biometrika, Vol. x1. p. 360. + Ibid. p. 361. t Ibid. p. 363. 
§ Erroneous numerical values for these constants are given, Biometrika, Vol. x1. p. 368. 
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giving respectively 
rg = 38895, ="06986 and =:01013, 
showing the rapid approach to zero of the value of 7,, as p approaches unity. 
We next turn to the case of n = 4. 


Here if p = sin a, then 


9 \ 
7 =— {cot a+a(1 —cot? a)! | 
Tv 


esidecinhcn eben (xxxlll). 
and o-=1—-7r—2cot?at+ 2a cot? al 
Now let a= = y, Where y is a small quantity. Then 
p=cosy= ] -7 acai 
.. tr f= l ry) 
7 _ ‘os Y+\3 y) — tan a 
= sea (F )a-y+ )\ 
a litietl|s -—% iia 
2 3 ‘ 
l ‘Ko @ 
Y Bar 7 uae 
, " fo 
Accordingly 1-7 =2,'- Pe ae 
p-Tr= Jey? — terms in ¥’*, 
7 
o2=1—7—2 tan?y +2 | 9 -y) tan® ¥ 
—_ = 3 7  - 
lw ty ts, 
_— , 1,3 ° 
Hence ae a Pe char ete skeen (xxxiv), 
p=1 9 p=1 / 16 
NV ‘—hl 37) 
=U. 


Or we conclude that for n=4, 7,.,=0. We thus see that for all the cases in 
which our limit to (9 —*7)/e, becomes imaginary a direct investigation shows 
that 7,¢,=0. We now return for a moment to the method by which our limit for 
n > 5 was obtained, and we ask: Is the expansion by Taylor’s series of the hyper- 
geometrical legitimate? The doubt which exists in the process is this: The differ- 
entials of the hypergeometrical series for the values of a, 8 (i.e. both 3 or both 1) 
and y=4(n +1) will ultimately become indefinite for some one or other value of 
n. Can we therefore use Taylor’s theorem at all? I think we can. For if we are 
considering the limit when n=s and accordingly a certain differential coefficient 

uBR 
becomes infinite, say d when a=1, we have really to consider what happens to 


pul 
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7 
ya'F : a cas , F “coaches zer r 1aAVS : . 
e" Fu 0 such a case when e approaches zero, and we can always make e approach 
aL 7 
zero by taking it equal to ¢e’(n—s), so that the factor n—s disappears from the 
denominator. For our purposes we need only the first two terms of Taylor’s 
expansion. Thus, if @ be a fraction between 0 and 1, we shall have : 


F, G, 3, $(n +1), 1 -€«) 


vy 1 9 ( 25 ~ 2 rwewty 

sl {1 —~3(n—3)°* 8a— 3)(n— 5) \! ~ 2(n— n°) ‘ retical 
where the term in @ expresses the remainder after two terms. It may be said that 
this will become infinite when xn =7. But as we are going to make e approach zero 
we can take it =(n— 7)’, where e’ is also infinitely small, and we reach precisely 
the same limit as before. It seems therefore that for n>5 our method of approach- 
ing the limit will apply. At the same time a full discussion of the subject by a 
more competent mathematician is desirable. 


ItI. On the Standard Deviation of the Regression Coefficient, ro,/o., and of 
the Standard Deviation of Arrays, o, Vi —?, in samples drawn from a normal 
population. 

As before the frequency surface is 

n-4 


o,\* ‘aa o,\" n—2 
z= a € a(*') e (@) (“122) ev(l—r) ? 


soos hRekva): 
we need to multiply this in succession by 

ro,/a2, (ro,/o2), o,V1—7%, and o,2(1—7°), 
and integrate over the whole volume, Le. 7=1 to —1, o,= 0 to ©, o,=0 to ow. 


We can do this most expeditiously by evaluating the integral of z multiplied by 
rP (1 — 7?) o to. 


ry DO poe 
W 6.40% =| | | zrPo,ha, (1 — 7) drdo,da. 
" 1 0/0 
eats b1 
= 2, 3,0hs30h | emt larttes") go ntn-2g N+? x [ ey? (1 — 2° )k @+t-49) drda,da. 
0 0 J -1 
dacnuuaaaen (XXXV11), 
where for brevity o,/ and oa.’ are written for o,/s, and a./s,. Let 
d 1 : Faas | 2; °2 
+1 
. 9\1 ) see 
Ip.=| ey? (1 — 72h Ott 8 der oo. cece eeees (XXXV111). 
: 


We have to distinguish p odd from p even. 


(i) podd. Take r=sin @ and expand the exponential ; only the odd powers of 
y remain, and 


Tv 


T,¢=2 (y sin?+! @ + sins 6+ sal cos" t-3 9 $6. 


“U0 


/ 
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Put «=sin?0, and we have: 
1 3 

i, t =| (ye x. Y a (p+2) _ >) ( = ay (nt+t—4) dy 
0 oO: 


Ge EGG D(z y p+2 
= (3 (p+n+t)) Ll! 3ipt+n+t 


y (p + 2)(p+4) ) See 
. +5i(ptn+t)(ptn+t+2)* pie EB awteceeen (XXX1X ). 
(ili) peven. We easily find : 
i ss 2 (sin +2, sine'*0 + a cos"t—29d0 
0 a: 
ate ss n+t- 4 
=| (ated + 5,2! rO+...)(l—a) 2 de 
Jo 2! } 
TQ(p SOY (1 y* p+ 
~  P&(ptnt+t—-)) 2!ptnt+t—1 
v3 3 
4 (p+ 1) (p+) +...) eee (xl). 
4! (pt+n+t—1)(p+ntitt+l) 


Now y= poyo./(s,s:)=po,'o,. Hence when we come to integrate these two 
expressions for o,' and @,' we require the integral, in the case of y", of 
pH pes 


U = p" | em 32 gta.” g ntti? g,/ntuta—? dg,'do.’. 
~ oO 0 


Changing the integration to y, and y, by the relations o)7=2y,, o.°=2y., 
we have 


Pies a 


U = ph antuthatay—3 enyt mtutans) dy, x | Evy t mtutn-9 dy,, 
70 70 
or U = pt 2rteHiata Pd (ntutq—-D)PQmt+utq—d) .«..¢xh). 


We have now to apply (xxxix) to (xli) to our special cases. 
(a) Mean value of ro,/o.= {ro,/o.}, say. 
Here p=1,q,=1, q.=—1,t=0. We have: 


(3) ($(n—2)) 


N ire,/e. = s's3 — 2 223 TP (1 (n+1) P(A (™m-1)) 
(Poul Ga TG (a+1)) eee ape See 


- ‘2p 4h 3 93” +ln-1 if 9 3.5 n+1n+3n—-1 n +1 +...) 
3!n+ 1 2 2 5!” inttamas+3) 3 2 y 2 


= 298 20 Var P(E (n—2)) PG —I))G a pe ehWe ee (xlii). 
— p*)} 


Hence, using (xix), a 
{9° /Gg} = 28, / Sq = P2q] Dig.o 2... .0cereeeeseseressees (xliii), 
which is the expression for the mean slope of the. regression lines of all samples. 
But as r is not symmetrically distributed about the value p and the mean value 
of r, i.e. 7, is not p (see (xxiii)), we could not @ priort assume that the mean value 
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of (ra,/e,) would be p>,/>=., the slope of the regression line in the sampled 
population. 

(b) Mean value of (ra,/o,)? = \(r0,/0.)"}, say. 

Here we must put p= 2, q,=2, q=—2,t=0. We have: 

i = fr 3 r (4 (n - ut | )) c (4 (n + “u— 3)), 

and 
a (3) rs (n — 2)) 
[ (3 (n+ 1)) 


N {(ro,/o2)} = 2 2" [1 ($ (n+ 1)) C(3 (n-3)) 


dp?n—3 spin—-3n-—1LTp® n-3n-1n+1 
ot het ie ad ae ae a et oe Meee i 





But the series on the right in square brackets is 
é , | 2)—} a) _1+(n—4) Pp 
dp ?' p ie — prim * 
Thus, using (xix) again : 
ae 
{(r0,/02)"; = 4 (*) ri 


and, using R to denote ro; ‘Gat 


Ge ayy 1 +n — 4D0*) pect er eeuee (xliv), 


n—1)) 


° ta 9) cs le 
Op = ((104/02)"5 — {1O4/F2} 


>2 (i (n—8)) : 
— * — ] % ay P = = 
Sarge + (n )p°) p*) 


T2/l+(n-4)p? \ Tel-p? 
= 


=3 —_— > — - ° 
‘ n—3 P) y+ n—3 
] > 
“Hi , a on} : 
Accordingly op=~ = VL Ap? cee ceceeeeeteeeereereeeees (xlv) 
VR — 3 <2 
Now the value usually taken for large samples* is: 
ee 
2 - 
lene ot ed te! Me (xlvi) 
Vn 2 
Thus the error in applying this to small samples lies solely in using for 
Vin 
l ‘ : B : 
For the case of p=*6 and n=10 (see our p. 190). the true value of 
Vn—3 


o,='5040R, while (xlvi) gave ‘4216R and the method of our p. 190, 4205 R. 
The error therefore in applying (xlvi) to compute a, does not lie in using the 


wrong values {60,60,}, {drdo,', {Srdo,} for small samples, but in neglecting terms 


in higher products in (xxviii). The formula (xlv) is as simple as that for large 


samples, and I have not considered it needful to provide a table. The value under 


the radical is somewhat unexpected and suggests that special treatment is required 
01% , o2\* 

. P ° - ‘ ,_o, Pr am oe ) 

for n= 3. In this case we have really to find the integral of ~e “*%e —\*/ and 

) a 


* Phil, Trans, Vol. 191 A, p. 245 or Biometrika, Vol. 1x. p. 9. 
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the limits being 0 and #, we approach at the lower limit the infinite area of a 
rectangular hyperbola. The fact is that for n= 3, the frequency of cases of o,=0 
is finite, and the second moment of the regression coefficient becomes infinite. The 
case is still more exaggerated for n = 2. Hence we conclude that it is not possible 
to determine the mean slope of the regression line for samples of less size than 
four. 

(c) Mean value of o,V1—r?= {oq}, say. 


Here p=9, n= 1,q.=0,¢=1, 


yr 


U s p* Qete-sti (s(n uy PCS (a t+u-—1)), 


and we find: 
. U(r m— Dd) 


Nog =% 878s : 22°"? T(in) 0 (4 (n—-1)) 
7 (dn) ° 7 


me p n—I p* n-lnt+l] , Pr n—-ln+1n4+3 \ 
a 2 ets F nS ee 


and this series is again (1 —p*y re, 
Hence using (xix) we find : 
{o4| =V2s, Cr (s(n —1) C(4 (n— 2)) 


(d(n—1)) 


=>, Vl-p? —— 
P’ (n/2)! F.(4 (n — 2) 
a a ee, | See enero (xlvii), 


in our previous notation. Thus the mean value of the standard deviation of arrays 
is not the value in the sampled population, until » is so large that we can take 
a Vin as substantially unity. In this matter the mean of the array standard 
deviation differs from the mean of the slope of the regression line. I give a table 
by which the approach of {o,} to o,, the standard deviation of arrays in the 
sampled population, can be judged. 

TABLE III. 


Ratio of Mean S.D. of A rrays in Samples to Array S.D. in Sampled Population. 





Size of Ratio Size of Ratio Size of Ratio Size of Ratio 

Sample | {¢,}/o. Sample | {o,!/o., Sample | {¢,}/o, Sample | {oy}/o4 
2 “0000 9 “8511 16 ‘9189 23 "9442 
} *4607 10 "8670 1? *9238 Ph “9466 
Y *6267 11 ‘8798 18 "9282 25 "9488 
) ‘7136 12 "8904 19 9321 50 ‘9747 
6 ‘7675 13 “R992 4) "9356 100 "9874 
” "S042 14 ‘9068 ?1 “9388 4,00 ‘9969 
8 “8308 15 "9132 £4 ‘9416 


Clearly in this case we can hardly speak of 








practical purposes even—before n= 50 to 100. 








“small 


samples” 


ceasing 


for 
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(d) Mean value of o2 (1 —7*) and standard deviation of the standard deviation of 


arrays in samples. 


Here p =0, q, = 2, q. = 0, t= 2 


; and 


U=p" 2" Tig (nt u+1))C4(m4+u-—1)). 


Hence N {o7(1—- r); = Zo 8,28, N 


x (1 +08 


and the series =(1 — p*)~* ita 
We have using (xix) : 


io," (1—?7"*)} 


= 8) 


/ 


3 = 


~ Toy 
oa: 


a2"? PT (dn) C4 (™m—1)) 


5) » 


2h (dn) CG (n—1)) 
Pr ($(n— 2)) PQ (w—-1)) 


= 87 (n —2), 


or jo (1 — 7") =O — p’) 


t] 


n—-2 


+1 p'n—1lnt+1ln+3 
-_ eo — = 


4 ds 


....(Xl vill). 


Thus the mean variance of the arrays in samples is not the variance in arrays 
of the sampled population, but the latter multiplied by the factor (n — 2)/n. 


To find the standard deviation of the standard deviation of arrays, we have : 


Os, = Pg OL —71)} — jo, V 1 — id 


or o 


oa . n—2 
an (8 — p*).( 


2 


i 


eR SOIREE (xlix). 


Clearly =, V1 — p?/V2n is the value we should have given to o,,0n the theory of 


large samples. Calling this value (o,,), I have computed the values of the ratio 
Fo,/(Fo,), and they are given in Table IV. 


TABLE IV. 


Values of the Ratio Fo,/(F 


Size of : Size of 
Sample Ratio Sample 
4 “0000 9 
o *85257* 10 
l ‘9265 * 11 
7) ‘9524 12 
6 ‘9651 13 
? 9725 4 1h 
8 ‘9774 15 





Ca 


Ratio 


“9808 


‘9834 
"9853 
“9868 
“9881 
‘9891 
“9900 


), for Various Sized Samples. 


Size of 
Sample 


16 
17 
1s 


19 





Ratio 


"9907 
*9914 
‘9919 
“9924 
“9928 
9932 


“9936 


Size of 





Sample 


Ratio 


*9939 
“9942 
“9944 
‘9974 
‘9987 
*9997 


It is clear that for most practical purposes we may suppose that for a sample of 


25 our ratio may be taken as unity. 
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It must be confessed of course that the resuits worked out in this paper only 
cover a limited series of sampled populations, i.e. those which follow the normal law. 
Their value is therefore rather for the light they throw on the general nature of 
small samples than on their direct application to any given series of small samples. 
We are a long way yet from a determination of the variation limits in the case of 
bi-variate small samples from any indefinitely large population whatever its law 
of distribution. Even in the present cases it is seen that the frequency distri- 
butions of the constants in samples have very complicated surfaces* of which it is 
not easy to get a clear geometrical conception. 


For much help in the calculation of the tables I have to thank my colleagues 
Miss Ethel M. Elderton and Miss M. Moul; for computing the 2, functions I am 
indebted to Miss B. N. Stoessiger, a statistical student in the Laboratory. Without 
their assistance the publication of this paper would have been indefinitely delayed, 
and I am most grateful for the aid they have given in the matter. 


* There is some hope of expressing by a finite function the distribution of regression coefficients. 
3ut in this case the higher moments of the distribution will ultimately become infinite for small 
samples, 


[While preparing this memoir from my notes, I received a paper from Professor 
Romanovsky. Writing without knowledge of the papers in Biometrika by “Student,” 
Fisher, myself and others (Vol. vi. pp. 1—25, Vol. x. pp. 507—521, Vol. x. 
pp. 522—529, and Vol. x1. pp. 328—417) and naturally without knowledge of my 
present paper, Professor Romanovsky had reached, dealing only with the algebraical 
side, many of the published results and certain additional ones. While willing to 
publish the latter, the present cost of printing prohibited the reproduction of 
much work already published or about to be published in this Journal. Recog- 
nizing the difficulty of an Editor in such matters, I sent him a proof of this paper 
and asked him to cable if he were willing that I should add under the title his 
name to my own. I have at last received a letter from Professor Romanovsky. He 
is satisfied with the statement that many results contained in the present and 
earlier papers have also been obtained by him quite independently and by a 
different method. I trust for the sake of his additional results that his paper may 
shortly be published elsewhere. This note explains the delay in the issue of the 
present parts of Biometrika, K. P. June 25, 1925.] 
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